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Abstract 

Proximity catch digraphs (PCDs) are based on proximity maps which yield proximity regions and are 
special types of proximity graphs. PCDs are based on the relative allocation of points from two or more 
classes in a region of interest and have applications in various fields. In this article, we provide auxiliary 
tools for and various characterizations of PCDs based on their probabilistic behavior. We consider the 
cases in which the vertices of the PCDs come from uniform and non-uniform distributions in the region 
of interest. We also provide some of the newly defined proximity maps as illustrative examples. 
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1 Introduction 



The proximity catch digraphs (PCDs) arc a special type of proximity graphs which arc based on proximity 
maps and are used in disciplines where shape and structure are crucial. Examples include computer vision 
(dot patterns), image analysis, pattern recognition (prototype sel ection), g eography and cartography, visual 
perception, biology, etc. Proximity graphs were first introduced bv lXoussaint. ((198ft ). who called them relative 
neighborhood graphs. The notion of relative neighborhood graph has been generalized in several directions 
and all of these graphs are now called proximity graphs. From a mathematical and algorithmic point of 
view, proximity graphs fall under the category of computational geometry. 

In recent years, a new classification and spatial pattern analysis app roach which i s base d on the relative 
positions of the data points from various classes has been developed. iPriebe et al.l (|200l[ ) introduced the 
class cover catch digraphs (CCCDs) and gave the exact and the asymptotic distribution of the domination 
number of the CCCD based on two dat a sets Xn and ym both of which are random samples from uniform 



distribution on a compact interval in R. iDeVinnev et all (|2002l) I iMarchette and Prie b3 (|2003f ). [Priebe et al.i 
(l2003bh . iPriebe et all (l2003al) . and lOeVinnev and Priebd (|2006h applied the concept in higher dimensions 
and demonstrated relatively good performance of CCCD in classification. The employed methods involve 
data reduction (condensing) by using approximate minimum dominating sets as prototype sets, since finding 
the exact minimum dominating set is in general an NP-hard problem — in particular, for CCCD — (see 
DeVinne"vl (|2003)). Furthermore, the exact and the asympto tic distribution of the domination number of the 



CCCDs are not analytically tractable in higher dimensions. ICevhanI (j2004l ) extended the concept of CCCDs 
by introducing PCDs, which do not suffer from some of the shortcomings of CCCDs in higher dimensions. In 
particular, two new types of PCDs (namely, proportional- edge and central similarity PCDs) are introduced; 
distribution of the domination number of prop ortional-edge PCDs is calculated , and is applied in testing 
spatial patterns of segregation and association (jCevhan and Prie 3 (I2005l . l2007l) ). The distributions of the 



relativ e ar c density of these PC D families are also derived and used for the same purpose (jCevhan et al 
(|2006l) and lCevhan et all (|2007t) ). 

A general definition of proximity graphs is as follows: Let V be any finite or infinite set of points in . 
Each (unordered) pair of points {p,q) ^ V x V is associated with a neighborhood *Jt(p, q) C R''. Let *P be a 
property defined on 01 = {01(p, q) : {p, q) € V^x F}. A proximity (or neighborhood) graph G<yi,<:p{V, E) defined 
by the property *p is a graph with the set of vertices V and the set of edges E such that (p, g) e i? iff 01(p, q) 
satisfies property *p. Examples of most commonly used proximity graphs are the Delaunay tessellation, the 
boundary of the convex hull, the Gabriel graph, relative neighbo rhood graph, Euclidean minimum spanning 
tree, and sphere of influence graph of a finite data set. See, e.g., Jaromczvk and Toussaint ( 1992h . 

The relative allocation of the data points are used to construct a proximity digraph. A digraph is a 
directed graph, i.e., a graph with directed edges from one vertex to another based on a binary relation. 
Then the pair {p,q) G V x V is an ordered pair and {p, q) is an arc (direct ed edge) denoted pq to r eflect 
its difference from an edge. For example, the nearest neighbor (di)graph in Paterson and Yaol ( 1992| ) is a 
proximity digraph. The nearest neighbor digraph, denoted NND{V), has the vertex set V and pq an arc 
iff d{p,q) = min„gy\{p} d{p,v). That is, pq is an arc of NND{V) iff g is a nearest neighbor of p. Note 
that if pq is an arc in NND{V), then {p, q) is an edge in RNG{V). Our PCDs are based on the property 
!P that is determined by the following mapping which is defined in a more general space than M."^. Let 
{i},M) be a measurable space. The proximity map N{-) is given hy N : ft ^ p{^), where p(-) is the 
power set functional, and the proximity region of x G il, denoted N(x), is the image of x G fl under N{-). 
The points in N{x) are thought of as being "closer" t o x G fl than are the points in \ N{x). Proximity 
maps are the building blo c ks of the proximity graphs of Toussaint (jl980l ) ; an extensive survey is available in 
Jaromczvk and Toussaint! (|l992[ ). 

The PCD D = (V,^) has the vertex set V = {pi,P2, ■ ■ ■ iPn} and the arc set A is defined by piPj £ A 
iff G N{pi) for i ^ j. Notice that D depends on the proximity map N{-), and if pj G N{pi), then N{pi) is 
said to catch pj . Hence the name proximity catch digraph. If arcs of the form piPi (i .e., loops) were allowed, 
D would have been called a pseudodigraph according to some authors (see, e.g., IChartrand and Lesniak 



2 



I 1996h ). 

In this article, we provide a probabilistic characterization of the proximity maps, and the associated 
regions and PCDs, and introduce auxiliary tools for the PCDs. We define the proximity maps and data- 
random PCDs in Section[21 describe the auxiliary tools (such as edge and vertex regions) for the construction 
of PCDs in Section[31 provide Fi-regions and the related concepts for proximity maps in SectionlU discuss the 
examples of proximity maps in Delaunay triangles in Section [51 and provide the transformations preserving 
uniformity on triangles in in Section [5l We investigate the characterization of proximity regions and the 
associated PCDs in Section [3 introduce Ffe-regions for proximity maps in r(3^3) in Section [51 K-valucs for 
the proximity maps in T (3^3) in Section [HI and provide discussion and conclusions in Section [TUl 



2 Proximity Maps and Data- Random PCDs 



Let Xn = {^1^X2^ . . . , Xn} and X„ = {Yi, • ■ • 7 ^m} be two data sets from classes X and 3^ of O- valued 
random variables whose joint pdf is fx,Y- Let : x — *■ [0,oo) be a distance function. The class 

cover problem for a target class, say X^, refers to finding a collection of neighborhoods, N{Xi) around 
X, g Xn such that (i) X,, C (Ui N{Xi)) and (ii) 3^,„ n (U, N{X,)) = 0. A collection of neighborhoods 
satisfying both conditions is called a class cover. A cover satisfying condition (i) is a proper cover of class 
X while a collection satisfying condition (ii) is a pure cover relative to class 3^. From a practical point of 
view, for example for classification, of particular interest are the class covers satisfying both (i) and (ii) 
with the smallest collection of neighborhoods, i.e., minimum cardinality cover . This class cover problem 
is a generalization of the set cover problem in Garfinkel and Nemhauseii ( 19721 ) that emerged in statistical 
pattern recog nition and machine le arning, where an edited or condensed set (prototype set) is selected from 
Xn (see, e.g.. iDevrove et al.l (|l996i )'). 

In particular, we construct the proximity regions using data sets from two classes. Given C fi, the 
proximity map Ny{-) associates a proximity region Ny{x) C with each point x G il. The region Ny{x) 
is defined in terms of the distance between x and 3^m- More specifically, our proximity maps will be based 
on the relative pos ition of points from class X with respect to the Delaunay tessellation of the class y. See 
Okabe et al. I (|2000l) and lCevhan (I2OO9I) for more on Delaunay tessellations. 

If Xn is a set of f2-valued random variables then Ny (Xi) are random sets. If Xi are independent identically 
distributed then so are the random sets Ny{Xi). We define the data-random PCD D = (V, A) — associated 



with Ny{-) — with vertex set V = Xn and arc set A by XiXj g A 



Xj e Ny{X,). Since this 



relationship is not symmetric, a digraph is needed rather than a graph. The random digraph D depends on 
the (joint) distribution of the Xi and on the map Ny{-). 

The PCDs are closely related to the proxim ity graphs of Jaromczvk and ToussaintI ( 1992) and m i ght b e 
considered as a special case of covering sets of Tuza (1994) and intersection digraphs of Sen et al. ( 1989| ). 
This data random proximity digraph is a vertex-random proximity digraph which is not of standard type. 
The randomness of the PCDs lies in the fact that the vertices arc random with joint pdf fx,Y, but arcs XiXj 
are deterministic functions of the random variable Xj and the set Ny{Xi). 



For example, the CCCD of iPriebe et al.l (j200ir ) can be viewed as an example of PCD with Ny{x) ^ 
B{x,r{x)), where r{x) :— miuygj;^ d{x,y). The CCCD is the digraph of order n with vertex set Xn and an 
arc from Xi to Xj iff Xj g B{Xi,r{Xi)). That is, there is an arc from Xi to Xj iff there exists an open ball 
centered at Xi which is "pure" (or contains no elements) of y„i, and simultaneously contains (or "catches") 
point Xj. 



3 Auxiliary Tools for the Construction of PCDs in 

Recall the proximity map (associated with CCCD) in R is de fined as B(x, r (x)) w here r{x) = unTiy^y^ d{x, y) 
with d{x, y) being the Euclidean distance between x and y ( Priebe et al. ( 200l( )). Our goal is to extend this 
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idea to higher dimensions and investigate the associated digraph. Now let = {Vi 7 y2, Ym} C R''. 
For d = 1 the proximity map associated with CCCD is defined as the open ball Ns{x) :— B{x,r{x)) for 
all a; e M \ 3^,„ and for x G 3^m, define Ns{x) = {x}. Fm'thermore, dependence on y„i is through r{x). 
Hence Ns{x) is based on the intervals = (y(i-i):mi Yi-.m) for « = 1, 2, . . . , (m + 1) with yo:m = ^00 and 
y(m+i):m = whcrc Yiim IS the i^^ order statistic in 3^^. This intervalization can be viewed as a tessellation 
since it partitions Ch (3^™), the convex hull of ■ For d > 1, a natura l tessellation that partitions Ch (3^m) 
is the Delaunay tessellation (see lOkabe et al. I (I2OOOI) and lCevhanI (I2OO9I) ). Let % for i = 1, 2, . . . , J be the i 
Delaunay cell in the Delaunay tessellation of y„i. In R, we implicitly use the cell that contains x to define 
the proximity map. 

A natural extension of the proximity region Ns{x) to multiple dimensions (i.e., to with d > 1) is ob- 
tained by the same definition as above; that is, Ns{x) := B{x,r{x)) where r{x) := minyg3;^ d{x,y). Notice 
that a ball is a sphere in higher dimensions, hence the name spherical proximity map and the notation Ns- 
The spherical proximity map Ns{x) is well-defined for all a:; £ R"^ provided that 7^ 0- Extensions to R^ 
and hig her dimensions w i th the spherical proximity map — with applications in classification — are investr 
gated in 



DeVinnev et all (|2002| ) . iDeVinnev and Wiermaiil (l2003l) . [Marchette and Priebd (|2003l). |Priebe etH 



Priebe et all (|2003bf) . and lDeVinnev and Priebd (2OO6I) . However, finding the minimum dominating 



set of the PCD associated with Ng{-) is an NP-hard pro blem and the distribution of the domination number 
is not analytically tractable for d > 1 (jCevhan This drawback has motivated us to define new 

types of proximity maps in higher dimensions. Note that for d = f , such problems do not occur. ICevhan 
( 2009f ) states some appealing properties of the proximity map Ns{x) = B{x,r(x)) in R and uses them as 
guidelines for extending proximity maps to higher dimensions and defining new proximity maps. After a 
slight modification, the spherical proximity maps gives rise to arc-slice proximity maps which is defined 
as Nas{x) ■= B{x,r{x)) H T (y^) for x G T (y^) (i.e., when the arc-slice proximity region is the spherical 
proximity region restricted to the Delaunay triangle x lies in). However, for x ^ Cniym) (i-e-, x is not in 
any of the Delaunay triangles based on 3^m), Nas{x) is not defined. 

For X G li, Ng{x) = iff a; = (y(i_i):„i + Yi-.m) /2. We define an associated region for such points in the 
general context. 



Definition 3.1. The superset region for any proximity map N{-) in O is defined to be ^s(7V) := {x G 17 : 
N{x) = fi}. When X is f]-valued random variable, then we assume X G ^s{^) if ^i^) = ^ a-S- Q 



For example, for 17 C R with i = l,2,...,(m - I), ^siNs) := {x € I^ : Ns{x) = I,} = 

{(y(i-i):m +yz:m) /2}, and for i = 0, m (i.e., 17 = /q or 17 = /,„, then ^s(iVs) = since Ns{x) C 1^ 
for all X G li ioT i = 0, m. More generally for 17 = 7^ C M'^ (i.e., i*'' Delaunay cell), ^s{Ns) {x G % : 
Ns{x) = %}. Note that for x G li, X{Nsix)) < X{h) and X{Nsix)) = X{I^) iffx G ^s{Ns) where A(-) is the 
Lebesgue measure on R (also called as R-Lebesgue measure). So the proximity region of a point in ^s{Ns) 
has the largest R-Lebesgue measure. Note that for y^ ~ {yi,y2, ■ • ■ ,ym} C R (i.e. 17 = R), ^%s{Ns) — 0, 
since Ns{x) C for all x € li so Ng{x) C R for all a; G R. Note also that given y^, ^si^s) is not a 
random set, but I{X G Ms{Ns)) is a random variable. 



3.1 Vertex and Edge Regions 

In R, the spherical proximity maps are defined as open intervals where one of the endpoints is in ym- 
In particular, for a; G I, = {yt:m,Y(i+i):m) for i l,2,...,m, Ns{x) (yi:m,y,:m 4- 2r(a;)) for all x G 
(yi:m,y(i-^i):Tn+yrm)/2) whcrc r(x) = d{x,y^■.ra) andiV5(a;) = (y(i+i):m,y(i+i):m-2r(a;)) for all a; G (y(i_l):m-|- 
yrm)/2, y(i+i):,„) where r{x) = d(a;,y(i+i):m). Hence there are two subinterval in li each touching an edge 
and the midpoint of the interval, and Ns{x) depends on which of these regions x lies in. 

In R'' with d > 1, intervals become Delaunay tessellations, and our proximity maps are based on the 
Delaunay cell % that contains x. The region Ny(x) will also depend on the location of x in 7^ with respect 
to the vertices or faces (edges in R^) of 7^. Hence for Ny{x) to be well-defined, the vertex or face of % 
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associated with x should be uniquely determined. This will give rise to two new concepts: vertex regions 
and face regions {edge regions in R^). 

Let 3^3 = {yi,y2,y3} be three non-coUinear points in and T {y^) = T(yi,y2,y3) be the triangle with 
vertices 3^3. To define new proximity regions based on some sort of distance or dissimilarity relative to the 
vertices 3^3, we associate each point in T (3^3) to a vertex of T (3^3). This gives rise to the concept of vertex 
regions. 

Definition 3.2. The connected regions that partition the triangle, T(3^3), (in the sense that the pairwise 
intersections of the regions have zero M^-Lebesgue measure) such that each region has one and only one 
vertex of T (3^3) on its boundary are called vertex regions. □ 



This definition implies that we have three vertex regions. In fact, we can describe the vertex regions 
starting with a point M g \ 3^3 as follows. Join the point M to a point on each edge by a curve such 
that the resultant regions satisfy the above definition. We call such regions M -vertex regions and denote the 
vertex region associated with vertex y as Ruiy) for y G 3^3. Vertex regions can be defined using any point 
M e \ 3^3 by joining M to a point on each edge. In particular, we use a cente r of T (3^3) as the starting 
point M for vertex regions. See the discussion of triangle centers in (|Cevhanl(l2009l) ) with relevant references. 
We think of the points in i?M(y) as being "closer" to y than to the other vertices. It is reasonable to require 
that the area of the region i?A/(y) gets larger as d{M, y) increases. Unless stated otherwise, M- vertex regions 
will refer to regions constructed by joining M to the edges with straight lin e segments. Vertex regions with 
circumcenter, incenter, and center of mass are investigated in lCevha For example M- vertex regions 

can be constructed with M € T(3^3)° by using the extensions of the line segments joining y to M for all 
y e 3^3. See Figure [T] (left) with M = Mc- 




Figure 1: The CM- vertex regions (left) and CM-edge regions (right) with median lines. 

We can also view the endpoints of the interval li as the edges of the interval which suggests the concept 
of edge regions. 

Definition 3.3. The connected regions that partition the triangle, T(3^3), in such a way that each region 
has one and only one edge of T (3^3) on its boundary, are called edge regions. □ 

This definition implies that we have exactly three edge regions. In fact, we can describe the edge regions 
starting with M in T (3^3)°, the interior of T (3^3). Join the point M to the vertices by curves such that the 
resultant regions satisfy the above definition. We call such regions M-edge regions and denote the region 
for edge e as RM{e) for e S {61,62,63}. Unless stated otherwise, Af-edge regions wifi refer to the regions 
constructed by joining M to the vertices by straight lines. In particular, we use a center of T(3^3) for the 
starting point M as the edge regions. See Figure [1] (right) with M = Mc. We can also consider the points 
in RM{e) to be "closer" to e than to the other edges. Furthermore, it is reasonable to require that the area 
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of the region Rm{g) gets larger as d{M,e) increases. Moreover, in higher dimensions, the corresponding 
regions are called "face regions" . 

Edge regions for incenter. center of mass, and orthocenter are investigated in (jCevhan (I2009M 



4 Fi-Regions for Proximity Maps and the Related Concepts 

For any set i? C SI, the Ti-region of B associated with N{-), is defined to be the region ri{B,N) := {z E 

: B C N{z)}. For x € fi, we denote ri({a;},A^) as ri{x,N). Note that Fi-region is based on the 
proximity region N{-). If = {Xi, X2, . . . , X^} is a set of J7-valued random variables, then Fi {Xi, N), 

1 — 1,2, ... ,n are random sets. If the X.^ are independent and identically distributed, then so are the random 
sets Fi {Xi,N). Additionally, Fi {Xn,N) is also a random set. 

In a digraph D = (V,^), a vertex v €V dominates itself and all vertices of the form {u : {v, u) G A}. A 
dominating set Sd for the digraph D is a subset of V such that each vertex w e V is dominated by a vertex in 
Sd- a minimum dominating set S'^ is a d ominating set of minimum cardinality and the domination number 
"f{D) is defined as 7(1?) := 15^1 (see, e.g., Lee (1993)) where | • | denotes the set cardinality functional. 



For Xi,X2, . . . , Xn *~ F the domination number of the associated data-random PCD, denoted jn{N), is 
the minimum number of points that dominate all points in Xn. Note that, 7n(A^) = 1 iff nFi {Xn, N) ^ 0. 
Hence the name V \-region. Suppose /i is a measure on fi. Following are some general results about Fi(-, A^). 

Proposition 4.1. For any proximity map N and set B QQ,, ^g{N) C Fi(i3, A^). 

Proof: For x G Ms{N), N{x) = U., so B <Z N{x) since B C n. Then a; G Ti{B,N), hence ^s(A) C 
Fi(S,A). ■ 

Lemma 4.2. For any proximity map N and B C SI, Ti(B, N) = HrceB ^1(2:, A). 

Proof: Given a proximity map A^ and subset B C il, y £ Ti{B, N) iff S C N{y) iff a; G N{y) for all a; G B 
iff y G Fi(a;, A) for all a; G B iff y G fixes ri(a;, A^). Hence the result follows. ■ 

Corollary 4.3. For any proximity map N and a realization Xn = {xi, X2, . . . , a;„} from F with support 
S{F) cn, Fi {Xn,N) = (X=,ri (x„A). 

A problem of interest is finding, if possible, a subset ofi?, say G C B, such that Fi(i3, A) = f]^^fjTi{x, N). 
This implies that only the points in G are used in determining Fi(_B, A). 

Definition 4.4. An active set of points Sa{B) C il for determining Fi(i?, A^) is defined to be a subset of 
B such that Fi(B, A) = 0^65^(3) ^1(2;, □ 

This definition allows B to be an active set, which always holds by Lemma [4.21 If B is a set of finitely 
many points, so is the associated active set. Among the active sets, we seek an active set of minimum 
cardinality. 

Definition 4.5. Let _B be a set of finitely many points. An active subset of _B C SI is called a minimal active 
subset, denoted S'^(-B), if there is no other active subset Sa of B such that Sa{B) C S^(B). The minimum 
cardinality among the active subsets of B is called the rj-value and denoted as ri{B, N). An active subset of 
cardinality rj{B,N) is called a minimum active subset denoted as Sm{B); that is, ri{B,N) := |S'm(-B)|. □ 



Note that Definitions 14.41 and 14.51 can be extended for any subset B C S7, in a similar fashion. Moreover, 
a minimal active set of minimum cardinality is a minimum active set. We will suppress the dependence on B 
for SAiB), Sfi{B), and ^^/(-B) if there is no ambiguity. In particular, if £? = Xn is a set of SI- valued random 
variables, then Sa and Sm are random sets and rin{N) is a random quantity. 
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For example, in R with 3^2 — {0, 1}, and Xn a random sample (i.e., set of iid random variables) of size 



n > 1 from F whose support is in (0, 1), Fi (AVi, Ns) 



where Xi-n is the r largest value 



2 ' 2 

in Xn. So the extrema (minimum and maximum) of the set are sufficient to determine the Fi-region; i.e., 
Sm = {^Xi-n, Xn:n} ■ Then rjn {Ns) = 1 + !(« > 1) a.s. for Xn being a random sample from a continuous 
distribution with support in (0, 1). 

In the multidimensional case there is no natural extension of ordering that yields natural extrema such 
as minimum or maximum. Some extensions of ordering are proposed under the title of "statistical depth" 
(see for example Liu et al. ( 1999[ )) which is not pursued here. To get the minimum active sets associated 
with our proximity maps, we will resort to some other type of extrema, such as, the closest points to edges 
or vertices in T (3^3). 

For any proximity map N and Xn , rjn (N) < n follows trivially, since 



r,n{N) min | |^| : Fi(A'„,iV) = f| Fi(Z) I 



Lemma 4.6. Given a sequence of ^-valued random variables Xi,X2, ■ ■ ■ from distribution F, let Xin) := 
Xin — 1) U {^n} for n = 0, 1, 2, . . . with X{Q) := 0. Then Fi {X{n), N) is non- increasing in n in the sense 
that Fl {X{n + 1), N) C Fi iX{n), N). 



Proof: Given a particular type of proximity map N and a data set X{n) = {^1, X2, ■ ■ 
m Ti iX{n), N) = nr=i ri iX^,N) and by definition, X{n + 1) = X{n) U {Xn+i}. So, 



1 Xn } , by Lemma 



n+l 



Fl {X{n + 1),N) 



fl Fi(X„iV) = 

71 



fl Fl {Xn+i,N) = Fl {X{n), N) n Fi {Xn+i,N) C Fi {X{n),N) 



Thus we have shown that Fi {X{n), N) is non-increasing in n; i.e., Fi {X{n + 1),N) C Fi {X{n), N). ■ 

Remark 4.7. By monotone sequential continuity from above ( Billingslev ( 1995f) ). the sequence {Fi {X{n), N)]^^^ 
has a limit 



Gi := nFi(A'(j),7V)= hm nFi(^(j),iV)= lim Fi(A'(m),7V) 



(1) 



lim nFi(X„7V) = nFi(X„7V). □ 

rn — ^00 ' ' ' ' 



1=1 1=1 

Theorem 4.8. Given a sequence of random variables Xi,X2, ■ ■ ■ which are identically distributed as F on 
n, let X{n) := X{n - 1) U {X„} with X{0) := 0. Then Fi {X{n), N) j ^s(iV), as n ^ 00 a.s. in the sense 
that Fl {X{n + 1), N) C Fi {X{n),N) and n{Ti {X{n), N) \ Ms{N)) [ a.s. 

Proof: By Lemma l4!6l and by monotone sequential continuity from above, {Fi (^"(71), iV)}J^i has a limit, 
namely, Gi in Equation ([1]). We claim that Gi = ^s{N) a.s. 

Suppose ^s{N) C n, since if ^siN) = ft then N{x) = n for all x £ fl, so Fi {X{n),N) = n for 
aU X hence the res ult would fol low trivially Since I%s{N) C Fi {X{n),N) for aU n, Ms{N) C Gi. From 
Proposition 5.6. in iKaril (|l992l ). y„ ^ F iff Ve > 0, lim„^oo P (supfc>„ |Yfc - y| > e) = 0. Let e > 0. 
Then 

p(supM(nri(^^,^)\^s(iV)) <e) = PU(nFi(X„iV)\^5(A^) 1 < e ) ^ 1 



, k>n 



\i=l 
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as ri ^ oo, because if Fi {X{n),N) \ Ms{N) had positive measure, then for each y G Fi {X{n),N) \ ^s{N), 
fl \ N{y) will contain data points from X(k) with positive probability for sufficiently large k > n. So y can 
not be in Fi {X{n), N), which is a contradiction. Hence the desired result follows. ■ 

Note however that Fi {Xn, N) is neither strictly decreasing nor non- increasing provided that ^s{N) 7^ fl 
for all Xn, because we might have Fi {Xn, N) C Fi {Xm, N) for some rn > n. Nevertheless, the following two 
results hold. 

Proposition 4.9. Suppose fl \ ^s(iV) has positive measure. For positive integers m > n, let Xn} and X^ 
be two samples from F on f2. Then /u(Fi {X^, N)) <^'^ Ai(Fi N)). 

Proof: Recah that for X - F and Y ^ G, X <^'^ Y if F{x) > G{x) for all x with strict inequahty holding 
for at least one x. 

Let m > n and Xn and X,n be two samples from F. Then /i(Fi iV)) is more often smaller than 
j u(Fi (Xn,N)) . Hen ce P[^{Ti {X„i, N)) < /i(Fi A^))] > 1/2 which only shows stochastic precedence 
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Now, let t e {fi{^s{N)),fi{^)), then /i(Fi (A'„,7V)) < t happens more often than ^(Fi TV)) < t, 
hence P {niTi (A"™, TV)) <t)>P (/i(Fi TV)) < t); that is, F^{t) > F„(i), where F,(-) is the distribution 
function for ^(Fi {Xi,N)) for i — m,n. For t < ijl{Ms{N)) or t > yu(ri), Fi{t) = for i = m,n. Letting 
iV„ := \Xn \ ^s{N)\ and iV,„ := \X„, \ ^s{N)\, then P(7V„ ^ N„,) > since \ Ms{N)) > 0. In fact, 
P{N,n > Nn) > 1/2. But if Fm{t) = Fn{t) for aU t were the case, then P(7V„ = N,n) = 1 would hold, which 
is a contradiction. ■ 

Theorem 4.10. Let {Xn\'^^i be a sequence of samples of size n from distribution F with support on fl. 
Then Fi {X,,, N) ^ ^s{N) m the sense that ^(Fi TV) \ Ms{N)) as 00. 

Proof: Given a sequence {Xn}^^i as in the theorem. By Proposition l4.11 Ms{N) C Fi {Xn,N) for each n. If 
Fi {Xn, N)\Ms{N) has zero measure as n — > cx) then result follows trivially. Otherwise, if Fi (A^„, N)\Ms{N) 
had positive measure in the limit, for each y e lim„_,oo Fi {Xm N) \ ^s{N), \ N{y) would have positive 
measure with positive probability, then Xn r\[Q \ N{y)] 7^ with positive probability for sufficiently large n, 
then y ^ Ti {Xn, N), which is a contradiction. ■ 

Since Fi {Xn, N) = n"=i Ti {Xi, N) for a given realization of the data set Xn, first we describe the region 
ri{X,N) for X e Xn, and then describe the region Fi {Xn,N). 

Theorem 4.11. If the superset region for any type of proximity map N has positive measure (i.e., fi{lMs{N)) > 
0/, then P{"fn{N) = 1) ^ 1 as n-^ 00. 

Proof: Notice that if there is at least one data point in ^s{N) then 7n(A^) = 1, because any point 
X g Ms{N) wiU have N{x) = so P(there is at least 1 point in ^5(iV)) < P{^n{N) = 1). Now, 

P(there is at least 1 point in Ms{N)) ^ 1 - P{Xn n = 0) = 1 - { ^^^^ ^ ^j^^n)'^^^^^ ^ ' ^^"^^^ ^^^^ 
to 1 as n ^ 00. Hence P(7„(A^) = 1) ^ 1 as n ^ 00. ■ 

The relative arc density of a digraph D ~ (V,.4) of order |V| = n, denoted as p{D), is defined as 



1^1 J 1 . 

p{D) = — ^ — where | • | denotes the cardinality of sets (jjanson et al.l ([20001)). Thus p{D) represents the 



ratio of the number of arcs in the digraph D to the number of arcs in the complete symmetric digraph of 
order n, which is n{n — 1). 

Theorem 4.12. // support of the joint distribution of Xn is subset of the superset region for any type of 
proximity map, then relative arc density p{Xn) = 1 a.s. 

Proof: Suppose the support of Xn is a subset of the superset region, then the corresponding digraph is 
complete with n{n — 1) arcs. Hence the relative arc density is 1 with probability 1. ■ 
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5 Transformations Preserving Uniformity on Triangles in M? 



The proximity regions and hence the corresponding PCDs are based on the Delaunay tessellation of 3^,„, 
which partitions Cniym)- So, suppose the set is a set of iid uniform random variables on the convex 
hull of i-e., a random sample from U{Ch (3^m))- In particular, conditional on \Xn H T^j > being fixed, 
Xn n Ti will also be a set of iid uniform random variables on for i S {1, 2, . . . , J}, where Ti is the i*^ 
Delaunay cell and J is the total number of Delaunay cells. Reducing the cell (triangle in M^) Ti as much 
as possible while preserving uniformity and the probabilities related to PCDs will simplify the notation and 
calculations. For simplicity we consider only. 

Let 3^3 = {yi, y2, ya} C be three non-coUinear points and T (y^) be the triangle (including the interior) 

with vertices yi,y2,y3- Let Xi U{T (y^)), the uniform distribution on r(3^3), for i = 1, 2, . . . , n. The 
probability density function (pdf) oiU{T (y^)) is 

/H = ^4(^i(-eT(m 

where A{-) is the area functional. 

The triangle T (y^) can be carried into the first quadrant by a composition of transformations (scaling, 
translation, rotation, and reflection) in such a way that the largest edge has unit length and lies on the 
X-axis, and the x-coordinate of the vertex nonadjacent to largest edge is less than 1/2. We call the resultant 
triangle the basic triangle and denote it as Ti, where Tb = ((0, 0), (1, 0), (ci, C2)) with < ci < 1/2, and 



C2 > and (1 — ci)^ + C2 < 1. The transformation from any triangle to Tf, is denoted by </){,. See ICevhan 
:br a detailed description of </);,. Notice that T (y^) is transformed into Tb, then r(3^3) is similar to 



Tb and 4>b (T (3^3)) = T^. Thus the random variables X, ~ U(T (3^3)) transformed along with T (3^3) in the 

described fashion by if)i, satisfy ipb{Xi) U(Tb). So, without loss of generality, we can assume T (3^3) to be 
Tb for uniform data. The functional form of Tb is 



Tb 



{(x, y) e M2 : y > 0; 2/ < (C2 x)lci- 2/ < C2 (1 - x)/(l - ci)} . 



There are other transformations that preserve uniformity of the random variable, but not similarity 
of the triangles. We only describe the transformation that maps Tb to the standard equilateral triangle, 
Tg = T ((0, 0), (1, 0), (1/2, -^3/2)) for exploiting the symmetry in calculations using Tg. 

Let 06 : (x, y) (u, v), where u(x, y) = x -\ ^=-^ y and v(x, y) = y. Then yi is mapped to (0, 0), 

V3 2c2 

y2 is mapped to (1,0), and ys is mapped to (1/2, V3/2). See also Figure [H 

Note that the inverse transformation is 4>~^{u, v) = [x{u, v), y{u, v)) where x{u, v) = u- 

y{u, v) = u. Then the Jacobian is given by 
v3 



U-2ci) 
V3 



V and 



J{x,y) 



So fu,v{u,v) ^ fx,Y{cb:\u,v))\J\ 



dx 
du 


dx 
dv 




1 


2 ci-l 


dy_ 

du 


dy 

dv 







2C2 

V3 



V3 



2C2 

^/3• 



l((w, u) e Te)- Hence uniformity is preserved. 



Remark 5.1. The probabilities for uniform data in T (3^3) involve the ratio of the event region to A(T (3^3)). 
If such ratios is not preserved under 0e, then the probability content for N depends on the geometry of 
2^(3^3)- In particular, the probability content f or uniform dat a for Ns and Nas depend on the geometry of 
the triangle, hence is not geometry invariant ( Cevhan ( 2009() ). For example, P{X € NsiY)) and P{X G 
Nas{YtM)) depends on (ci,C2), hence one has to do the computations for all of (uncountably many) of 
these triangles. Hence we do not investigate Ns and Nas further in this article. □ 
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6 Sample Proximity Maps in 

Let y„i ~ {yi,y2, • ■ • ,ym} be m points in general position in R"^ and % be the z*'* Delaunay cell for i = 
1, 2, . . . , J, where J is the number of Delaunay cells. Let Xn be a random sample from a distribution F in 
with support S{F) C (3^„0- 

In particular, for illustrative purposes, we focus on R^, where a Delaunay tessellation is a triangulation, 
provided that no more than three points in y,n are cocircular. Furthermore, for simplicity, let 3^3 = {yi, y2, Vs} 
be three non-coUinear points in and T (y^) = r(yi,y2,y3) be the triangle with vertices 3^3. Let Xn be 
a random sample from F with support S{F) C T^y^). In this section, we will describe two families of 
triangular proximity regions. 

6.1 Proportional-Edge Proximity Maps 

The first type of triangular proximity map we introduce is the proportional-edge proximity map. For this 
proximity map, the asymptotic distribution of domination number and the relative density of the corre- 
sponding PCD will have mathematical tractability. 

For r e [1,00], define Np^{-, AI) :~ A^(-, il/; r, 3^3) to be the proportional- edge proximity map with M- 
vertex regions as follows (see also Figure |3] with M = Mc and r = 2). For x G T (y^) \ y^, let v{x) G y^ be 
the vertex whose region contains x; i.e., x € RMiv{x)). If x falls on the boundary of two M-vertex regions, 
we assign v{x) arbitrarily. Let e{x) be the edge of T(3^3) opposite v{x). Let £{v{x),x) be the line parallel 
to e{x) through x. Let d{v{x), i{v{x), x)) be the Euclidean (perpendicular) distance from v{x) to £{v{x),x). 
For r e [1, 00), let ir{v{x), x) be the line parallel to e{x) such that 

d{v{x),£r{v{x),x)) = r d{v{x),£{v{x),x)) and d{£{v{x),x),£riv{x),x)) < d{v{x),£r{v{x),x)). 

Let Tr{x) be the triangle similar to and with the same orientation as T (y^) having v{x) as a vertex and 
£r{v{x),x) as the opposite edge. Then the proportional-edge proximity region Np^{x, M) is defined to be 
Tr{x) n T (3^3). Notice that i{v{x),x) divides the edges of Tr{x) (other than £r{v{x),x)) proportionally with 
the factor r. Hence the name proportional edge proximity region. 

Notice that r > 1 implies x £ N^^{x,M). Furthermore, limr^oc- Np six, M) = T(3^3) for all x € 
T (ys) \ 3^3, so we define N'^j^{x, M)=T (3^3) for all such x. For x G 3^3, we define N'pp,{x, M) = {x} for aU 
r e [1, Qo\. 

Notice that Xi F, with the additional assumption that the non-degenerate two-dimensional probability 
density function / exists with support S{F) C T(3^3), implies that the special case in the construction of 
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Np^ — X falls on the boundary of two vertex regions — occurs with probability zero. Note that for such 
an Np^{X) is a triangle a.s. 




Figure 3: Construction of proportional edge proximity region, Np^{x) (shaded region). 



The functional form of Np^{x, M) for x ~ (i-q, j/o) G is given in lCevhan (I2OO9I) . Of particular interest 



is Np^{x, M) with any M and r <E {\/2, 3/2,2}. For r — ^/2, £{v{x),x) divides T^{x) into two regions of 

equal area, hence N^{x, M) is also referred to as double-area proximity region. For r = 2, l!.{v{x),x) divides 
the edges of T2{x) — other than ir{v{x),x) — into two segments of equal length, hence Np^{x, M) is also 
referred to as double-edge proximity region. For r < 3/2, ^%s{Np^^ Mc) = 0, and for r > 3/2, ^%s{Np^, Mc) 
has positive area; for r = 3/2, S^si^p^, Mc) ~ {Mc}- Therefore, r = 3/2 is the threshold for the superset 
region to be nonempty. Furthermore, r = 3/2 will be the value at which the asymptotic dist r ibutio n of the 
domination number of the PCD based on Np^{-, Mc) is nondegenerate ( Cevhan and Priebd ( 20051 )). 

Let {Np^, M) be the superset region for Np^ based on A/-vertex regions with orthogonal projections. 
Then the superset region with the incenter {Nj,^, Mj) is as in Figure [3] (right). Let Mi be the midpoint 
of edge e^ for i = 1,2,3. Then T(Mi,M2,M3) C ^^{Nj,j^,Mi) for all T {ys) with equality holding when 
T (3^3) is an equilateral triangle. 

For Nj,j^{-, Mc) constructed using the median hues {Np^, Mc) = r(Mi, M2, M3), and for Nj,j^{-, Mc) 
constructed by the orthogonal projections, [Np^, Mc) 3 T(A/i, Af2, M3) with equality holding when 




Figure 4: Superset region {^pe^ Mcc) in an acute triangle (left), superset region, ^g{Np^, Mj) (right) 
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Figure 5: The triangle ,9''=^. 



^'(3^3) is an equilateral triangle. 

In T (3^3), drawing the lines ^i(r, x) such that d{yi,ei) — r d(^i(r, x), y^) for j e {1,2, 3} yields a triangle, 
for r < 3/2 . See Figure [5] for with r = ^2. 

The functional form of in is 

[ r r (1 — ci) r ci J 

(r-l)(l + ci) C2(r-1)\ /2-r + Ci(r-l) C2(r-1)\ /ci(2-r)+r-l Ca (r - 2) 



= T 

There is a crucial difference between and T{Mi, M2, M3): r(A'/i, Afz, M3) C [N'pj^.M) for all M 
and r > 2, but (^'')° and {N^e,M) are disjoint for all A/ and r. 

So if Af e (^^J^O", then (7V|,^, M) = 0; if A/ e 5(5^''), then i^s (iV^^, Af ) = {A-/}; and if AI ^ 
then ^5 {Np^,M) has positive area. The triangle defined above plays a crucial role in the analysis of 
the distribution of the domination number of the PCD based on proportional-edge proximity maps. The 
superset region ^5 [Np^, M) will be important for both the domination number and the relative density of 
the corresponding PCDs. 

The functi onal forms of the superset region, Ms {Np^, M), and T{Mi, M2, M3) in are provided in 
CevhanI ^0^. 



Np^{-,M) is geometry invariant if A/-vcrtex regions are constructed with M G T {y^)" by using the 
extensions of the line segments joining y to A/ for all y G 3^3. B ut when t he ve rtex regions are constructed 
by orthogonal projections, Np^(-, M) is not geometry invariant ( Cevhan ( 2009f )). hence such vertex regions 
are not considered here. 



6.1.1 Extension of Npp, to Higher Dimensions 

The extension to R'' for d > 2 is straightforward. The extension with M = Mc is given her. but the 
extension for general M is similar. Let J^d+i = {yi, y2, • ■ • , Yrf+i} be d + 1 points that do not lie on the same 
d — 1-dimcnsional hypcrplane. Denote the simplex formed by these c? + 1 points as 6(3^^+1). A simplex is 
the simplest polytope in having d + 1 vertices, d (d + l)/2 edges and d + 1 faces of dimension (d — 1). For 
r G [1, 00], define the proximity map as follows. Given a point x in ©(J^^+i), let v := argminyg-y^^^ Y{Qy{x)) 
where Qy{x) is the polytope with vertices being the d(d + l)/2 midpoints of the edges, the vertex v and 
X and V(-) is the d-dimensional volume functional. That is, the vertex region for vertex v is the polytope 
with vertices given by v and the midpoints of the edges. Let v{x) be the vertex in whose region x falls. If 
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X falls on the boundary of two vertex regions, v{x) is assigned arbitrarily. Let (p{x) be the face opposite 
to vertex v{x), and T(v{x),x) be the hyperplane parallel to f{x) which contains x. Let d{v{x),T{v{x), x)) 
be the (perpendicular) Euclidean distance from v{x) to T{v{x),x). For r € [l,oo), let Tr{v{x), x) be the 
hyperplane parallel to ip{x) such that 

d{v(x), Tr{v{x), x)) = r d{v{x), T{v{x), x)) and d{T{v{x), x), Tr(v{x), x)) < d{v{x), Tr(v{x), x)). 

Let &r{x) be the polytope similar to and with the same orientation as 6 having v(x) as a vertex and 
Tr{v{x),x) as the opposite face. Then the proximity region Np^{x, Ale) ■— 6r{x) H ©(J^d+i)- Notice that 
r > 1 imphes x e Np^{x, Mc)- 

6.1.2 Fi-Regions for Proportional-Edge Proximity Maps 

For Np^{-, M), the Fi-region is constructed as follows; see also Figure [51 Let ^i{r,x) be the line parallel to 
d such that 6(r,x) nr(y3) ^ and r d{yi,^i{r,x)) = d{yi, e{yi, x)) for i e {1,2,3}. Then 



Fi {x,N'pE,M)nRM{y.,) = {z e i?M(yO : d{y,,i{y„z)) > d{y„^,{r,x)} for i e {1,2,3}. 

Notice that r > 1 implies x E Fi (x, TVp^, Af ). Furthermore, lim^-^oo Fi (.t, A^p^, A/) = 2^(3^3) for all 
xeT (3^3) \ 3^3 and so we define Fi {x, N^=I?°,M) = T (3^3) for aU such x. For x € 3^3, Ti [x, N^^, M) = {a;} 
for all r S [1, 00]. 
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Fi (x, N'i^E.M) = y [Fi (.T, N-pE, M) n i?M(y.O] 



i=l 



where 




Figure 6: Construction of the Fi-region. Fi [x, N'p-j^.Mc) (shaded region). 



The functional form of Fi {x — (xq, 2/0); ^pe^ ^^'^ basic triangle is given by 
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Fi {x = (xo, yo), N'PE, M) = |J [Fi {x = (xq, yo), A^pb, M) n i?M(2/z)] 
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Figure 7: Ti {x,NIe,Mcc) with x £ Rcciyz) (left), x G i?fc'(y3) (right). 



where 



Fi (a^ = (xo,yo),AfpB,M)ni?M(yi) = |(a^,y) £ i?Af(yi) 
Fi (a: = (xo, yo), A^pb, M) n i?M(y2) = | (a;, v) £ Rniyi) 

Fi (x = (xo, yo), A^PB, M) n i?A/(y3) = j (a;, v) e i?A/(yi) 




C2 (rx - ccq) 'I 
(l-ci)r /' 




C2 (r (x - 1) + 1 - Xo 



} 



yo - C2 (1 - r) 



r 



} 



Notice that Fi (x, -/Vp^;, Mc) is a convex hexagon for aU r > 2 and x ^ T (y^) \ 3^3, (since for such an x, 
Fi (x, Np^,Mc) is bounded by ^i(r, x) and for all i £ {1, 2, 3}, see also Figure[6]) else it is either a convex 
hexagon or a non-convex polygon depending on the location of x and the value of r. 

Furthermore, in M.'^ with d > 2 let Ci{x) be the hyperplanc such that Ci(x)n(5 (3^m) 7^ and r (i(yi, Ci{^)) = 
d(y„?7(y,,x)) for i = 1, 2, . . . , (d + 1). Then Fi (x, TV^^) n i?c a/ (y») = € Rcmiyi) ■ d(y„ ?7(y,, z)) > 
'^(y»,C»(a;)}, for i 1,2,3. Hence Fi (x,iV^£) = Uf=i (^i (2^: ^t^b) H i?CA/(yO). Notice that r > 1 implies 
X £ Np^{x) and x £ Fi (x, Np^)- 

So far, we have described the Fi-region for a point in x £ T (3^3). For a set Xn of size n in T (3^3), the 
region Fi {Xn, Npp,, M) can be exactly described by the edge extrema. 

Definition 6.1. The (closest) edge extrema of a set B in T (3^3) arc the points closest to the edges of T (3^3), 
denoted Xe^ for i £ {1,2,3}; that is, Xg. £ arginf^^^ (i(x, e^). 

Note that if B = Xn is a random sample of size n from F then the edge extrema, denoted . (n) , are 
random variables. 

Proposition 6.2. Let B he any set of n distinct points in T iy^) and x^. £ arginf^.^^ (i(x, e^). For 
proportional- edge proximity maps with M-vertex regions, Fi {B, Np^, M) = nf=i {^en Np^^ M). 

Proof: Given B = {xi, X2, . . . , Xn} in T (3^3). Note that 



and if d(y,, £(y,, x)) < d{y^,£{y^,x')) then N^j^{x,M) C Af|,^(x',M) for aU x,x' £ Ruiyi)- Further, by 
definition x^^ £ argmax^^^ (i(yi, ^^(r, x)), so 

Fi (B, N'pE, M) n i?A/(yO = Fi (xe. , N'pE, M) n i?A/(y») for i £ {1, 2, 3}. 



n 



Fi (B, AT^B, M) n i?A/(yO =0^1 (^», A^PB, M) n i?A/(y.), 
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Furthermore, Ti (B, N^p^, M) ^ ULi [^i (a^e, , N'pe. M) n RM{y^)] , and 



3 

f|ri {xe,,N-pE,M) 



nRM{y^) for ^e {1,2,3}. 



Combining these two results, we obtain Fi {B, Np^, M) = Hj^i {xe^ , Np^, M) . ■ 

From the above proposition, we see that the Fi-region for B as in the proposition can also be written as 
the union of three regions of the form 

Fi (B,7V^s,Af) ni?M(yO = {2 e RM{y^) ■ d(y,;,£(y„z)) > d(y„C.(r,x,J)} for i e {1,2,3}. 

Corollary 6.3. Let X„ be a random sample from a continuous distribution F on T (3^3). For proportional- 
edge proximity maps with M -vertex regions, r/„ (Np^) < 3 with equality holding with positive probability for 
n > 3. 

Proof: From Proposition 16.21 rjn (Np^) < 3. Furthermore, Xe{n) is unique for each edge e a.s. since F is 
continuous, and there are three distinct edge extrema with positive probability. Hence P{rin {Np^) = 3) > 
for n > 3. ■ 

Then Fi {Xn,Np^, M) — ni=i Ti (Xg. , iVp^), where e,; is the edge opposite vertex y^, for i = 1, 2, 3. So 
Fi {Xn,N■'pE,M)r^RM{y^) = {z e RM{y^) ■ d(y„£(y„z)) > c^(y^,6(r,a;eJ)}, for i = 1,2,3. 

Note that P{rin {Np^) = 3) ^ 1 as n ^ 00 for Xn a random sample bomU{T (3^3)), since edge extrema 
are distinct with probability 1 as n — s- cx) as shown in the following theorem. 

Theorem 6.4. Let Xn be a random sample from U(T (3^3)) and let Ec^sin) be the event that (closest) edge 
extrema are distinct. Then P{Ec^3{n)) — > 1 as n 00. 

Proof: Using the uniformity preserving transformation c/ig without loss of generality, one can assume X^, is 
a random sample from U{Te). Observe also that the edge extrema in Th arc mapped into the edge extrema 
in Tg. Note that the probability of edge extrema all being equal to each other is P{Xei{n) = Xe2{n) = 
Xeain)) = l{n = 1). Let £'c,2("-) be the event that there are only two distinct (closest) edge extrema. Then 
for n > 1, 

P{E,,2{n)) = P{X,An) = X,,{n)) + P{X,,{n) = X,,{n)) + P{X,Jn) = X,,{n)) 

since the intersection of events Xe-^{n) ~ Xe^ln)), Xei(n) = Xe^in), and Xe2(n) = X^^in) is equivalent to 
the event Xf.-^(n) = Xe2{'n) = Xe3(n). Notice also that P{Ec,2{n ~ 2)) = 1. So, for n > 2, there are two or 
three distinct edge extrema with probability 1; i.e., P{Ec;i{n)) + P{Ec.2{n)) = 1 for n > 2. 

We will show that P{Ec,2{n)) ^ as n ^ 00, which will imply the desired result. 

First consider P{XQ^{n) ~ XQ^in))- The event Xei(n) = Xp^in) ~ X^ ~ {X,Y) is equivalent to the 
event that A'„ C {?/ e Te : d(yi, ^(yi, C/)) < (i(yi, ^(yi, X^)), d(y2, ^(y2, C/)) < d(y2, ^(ya, Xe))}. For example, 
if given Xe^ {n) = Xf^^ ("-) = {^i u) the remaining n — 1 points will lie in the shaded region in Figure[S] (left). 
For other pairs of edge extrema, see Figures [S] (right) andl^l 

The pdf of such X^ = {X,Y) is f{x,y) = n (4/V3) (?;2/V3)""\ Let e > 0, by Markov's inequality 
P (^/3/2 - r > e) < E [V3/2 - Y] /e. But, 



E 



\/3/2-rJ = J J {y3/2- yj n [y^/VSj [i/VSjdydx 

(V3/2 - n [yyVi) (4/x/3j dy 
1 

n{4:n^ - 1)' 



dx 



1/2 Jo 
%/3/4 
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which converges to as n s- oo. So if Xe-^{n) = Xe2{n) were the case, then geometric locus of this point 
goes to Vs. That is. for each e > 0, P (a/S/S — Y>e) -^Oasri-^ oo. Hence P{Xei{n) ~ Xe2{n) ^ ya) — * 
as n ^ oo. Furthermore, P{Xf,^{ri) = Xe^^n) = ya) < P(Xe2(n) G 62) = for aU n > 1. So P{Xf.^{n) = 
Xe2 (n-)) — ^ as n — > 00. 

Likewise, by symmetry, it follows that lim„^oo P{Xex{n) = Xe3{rL)) = lim„^oc P{Xp2{n) = Xe.^{n)) = 0. 
Hence P(£'c,2('^)) Q as n —* 00. Thus P(£'c,3(n)) ^ 1 as n ^ 00. ■ 

The above theorem implies that the asymptotic distribution of r/n (Np^) is degenerate with P{i]n (^p^;) = 

3) — > 1 as ri 00. But for finite n, rjn{Np^) for Xi U{T {y^)) has the following non-degenerate 



where n2{n) G (0, 1) is the probability of edge extrema for any two distinct edges being concurrent. 



Figure 8: The figure for Xei{n) ~ Xe2in) = {x,y) (left) and Xe2{n) = Xe^in) = {x,y) (right). 

Remark 6.5. If Xn is a random sample from F such that S{F) D {a; G T (3^3) : c?(x, e;) < ei} has positive 
measure and S{F) n B{\/i,e2) = for some £1,22 > 0, then P{Ec^3{n)) ^ 1 as n ^ 00 follows trivially. 
However, the case that F has positive density around the vertices 3^3 requires more work to prove as shown 
below. □ 

Theorem 6.6. Let Xn be a random sample from F such that B{yi,e) C S{F) for some e > and for each 
i — 1,2, 3, then P{Ec,3{n)) 1 as n ^ 00. 



distribution. 




3 



2 



wp ■K2{n) 

wp 7r3(n) = 1 - 7r2(n), 





Figure 9: The figure for Xf.^{n) = Xf.^{n) — {x,y). 
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Proof: Using the transformation (f)(. : {x, y) {u, v), above, we can without loss of generaUty assume Xn is 
a random sample from F with support S{F) C T^. After (/)e is applied, suppose F becomes Fe-, then (f)e{Xn) 
becomes a random sample from F^ such that B{cf>e{Yi), Se) ^ S{Fe) for some Se > and for each i = 1, 2, 3. 
First consider P(Xe-^ = Xe^). Given X^-^ = Xg.^ = {x,y) the remaining n — 1 points will lie in the shaded 
region in Figure [H] (right). X^-^ = X^^ ~ {X^Y) is equivalent to the event that 

Xn C Sr{X, Y) {([/, VK) G : £(yi, ([/, W)) < ^(yi, {X, Y)), ^(ya, ([/, W)) < ^(ys, {X, Y))}. 

The pdf of such {X,Y) is fix,y) = nG{x,y)'^-\f{x,y) where G{u,v) := Pf{X e Sr{u,v)). Note 
that P{Xe2 = Xe^ = yi) = for all n, and = Xg, 7^ yi is equivalent to d{{X,Y), yi) > 0. 

Let e > 0, by Markov's inequality, P(d((X, F), yi) > e) < E F), yi)] /e = E [V^2 + y2j 

Switching to the polar coordinates as X = RcosO and Y ~ Rs'mO, we get \/X^ + Y^ = R. But, 

E [i?] = / / nr G{r^9Y^~^ f{r^9)r dr d9. Integrand is critical at r = 0, since for r > it converges 
Jo Jo 

to zero as n — > 00. So we use the Taylor series expansion around ?■ = as 
fir,9) = f{O,0) + ^^^r + O{r'), 

G(r, 9) ^ G(0, 9) + ^^^r + O (r^) = 1 + ^^^r + O (r^) . 

OG(0 9) 

Note that — - — < 0, since area of Sr (u, v) decreases as r increases for fixed 9. So let r = w/n, then 

or 



7r/3 



1 



ill exp 



'0 Jo 

Hence P{Xe2 = X^^ 7^ yi) ^ as n ^ 00. Then P{Xe2 = ^es) as n — > 00. 

Likewise, it follows that lim„_,oo Pi^ei = ^62) = hm„^oo P{Xei = X^s) = 0. Hence P{Ec;2{n)) ^ as 
n — > 00. Thus P{Ec^3{ti)) ^ 1 as n — > 00. ■ 

Notice that Theorem 16.41 follows as a corollary from Theorem 16.61 For r > 3/2 and A/ e \ 3^3 , 
Fi (<%-„, 7V^^, A/) 7^ a.s., since (A^^b, A/) 7^ and Ms {N^pe, M) C Fi (A",,, 7V^^, M). 

Now, for n > 1, let Xei{n) = x^^ = {ui,Wi) be given for i G {1,2,3}, be the edge extrema in a given 
realization of Xn- Then the functional form of Fi-region in Tf, is given by 



Ti {X„, iV^s, M) = U [Fi {Xn, N^pE, M) n RM{y^)\ 



where 



1=1 



Fi {x^,N^pE,M)r^RM{yl) = { {x,y) e i?A/(yi) ■.y>—~ ''^1;'' 

r (1 — ci) r 

W2 C2 (r (x - 1) + 1 - U2 

r ci r 

w^. - C2 (1 - r) ■ 



Fi (A'„,iV^B,M)ni?M(y2) = ^{x,y) e i?M(yi) 
Fi (A'„,7V];b,M) n i?A/(y3) = {{x.y) e Rniyi) ■■ y < 



See Figure [TOl for Fi (<¥„, iVj.^;, A/) with 71 > 3 where M- vertex regions for A/ = Mcc and A/ = A// with 
orthogonal projections are used. Note that only the edge extrema are shown in Figure fTOl (right). 
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Figure 10: Ti TVj.^;, M) for M = Mcc (left) and M = Mj (right) with three distinct edge extrema. 




Figure 11: Fi (x, 7vXl> ^'^) with x G Rcciyi) (Mt), x e Rfciv^i)- 

Note that, for AVi a random sample from W(T(3^3)), P^qn {^pe) = 3) ^ 1 as n oo, since the edge 
extrema are distinct with probabihty 1 as n ^ oo. However, for r < 3/2, the region Fi {x, Np^, Af)ni?j\/(yi) 
might be empty for some i G {1,2,3}. Furthermore, if M G {^^)° (see Equation ([2]) for ^'') with r < 
3/2, then Fi {Xn, Np^, M) will be empty with probability 1 as n ^ oo. In such a case, there is no Fi- 
region to construct. But the definition of the 77-value still works in the sense that Fi {Xn, Np^, M) = 
= na;GS„ri {x,N^E,M) (see Definition HH for Sm) and Fi {x,N^e,M) ^ for all x € X,, since a; G 
Fi (a;, Np^, M). To determine whether the Fi-region is empty or not, it suffices to check the intersection of 
the Fi-regions of the edge extrema. If M ^ (^'')°, the Fi-region is guaranteed to be nonempty. 

Note that 7y„ (Np^) = rin {N^pe) ^'^^ (''Ij ''2) £ [l,oo)^, where = stands for "equahty in distribution". 

See Figure [T^ for Fi ^A'„, where M-vertex regions for AI = Mcc a-nd AI = AIj with orthogonal 

projections are used. 
Remark 6.7. 

• For ri < r2, Fi {x,Npj^,Ad) C Fi {x,Npj^,A'I) for all x S T (Ja) with equality holding only when 

X G 3^3- 

• For ri < r2, Fi (A'„, TVp^, A/) C Ti Np^j^, AI) with equality holding only when X„ C or 
Fi (A'„,iV^^,Af) = 0forr = ri,r2. 
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Figure 12: Ti \^X,i, N^^, Mj for M = Mcc (left) and M = Mj (right) with three distinct edge extrema. 

• Suppose X,Y are iid from a continuous distribution F whose support is S{F) C T(3^3). Then for 
n < r2, A {T^ (X, iVp^, M)) A {T, [Y, 7V;^^, M)). 

• Suppose <Y„ and arc two random samples from a continuous distribution whose support is 
S{F) C T(3^3). Then for n < r2, A{Ti (A'„, TV^^^, Af )) A(ri (A",;, iV;^, A/)). □ 

Remark 6.8. In R'^ with d > 2, recall 6 (3^m), the simplex based on d + 1 points that do not lie on the same 
hyperplane. Furthermore, let Qi{r,x) be the hyperplane such that Qi{x) n 6 (3^™) 7^ and rd{yi, Qi{r,x)) ~ 
d(yi, T{y„ x)) for i € {1, 2, . . . , d + 1. Then 

Fi (x, N^pE, Mc) n RcM{y^) ^{ze RcM{yr) ■■ d{y,, T(y„ z)) > d{y„ g,{r, x)} for i e {1, 2, 3}. 

Hence Fi (x, N^^^ ^^c) = U^t^i (^i (x, N^e^ Afc)ni?cA/(y»))- Furthermore, it is easy to see that Fi TV^.^,, Mc) 
C\i=i Ti (Xip. (n), A^p^;, A/p), where Xip^{ri) is one of the closest points in Xn to face tpi. □ 

6.2 Central Similarity Proximity Maps 

The other type of triangular proximity map we introduce is the central similarity proximity map. Fur- 
thermore, the relative density of the corresponding PCD will have mathematical tractability. Alas, the 
distribution of the domination number of the associated PCD is still an open problem. 

For T g [0, 1], define N^g{-, M) := N{-, Af; r, 3^3) to be the central similarity proximity map with Af-edge 
regions as follows; see also Figure [13] with M = Mc- For x £ T {y^) \ 3^3, let e(x) be the edge in whose 
region x falls; i.e., x £ i?j\f (e(a:)). If x falls on the boundary of two edge regions, we assign e{x) arbitrarily. 
For T G (0, 1], the parametrized central similarity proximity region N^g(x, M) is defined to be the triangle 
Tt{x) with the following properties: 

(i) Tr{x) has edges e'l{x) parallel to for each i G {1,2,3}, and for a; £ i?7\/(e(a;)), d{x,e'^{x)) = 
T d{x,e{x)) and d{e'^ (x) , e{x)) < d(x,e{x)) where d{x,e{x)) is the Euclidean (perpendicular) distance 
from X to e(x); 

(ii) Tt{x) has the same orientation as and is similar to T (3^3); 

(iii) X is the same type of center of Tr{x) as M is of T (3^3). 

Note that (i) implies the parametrization of the proximity region, (ii) explains "similarity" , and (iii) explains 
"central" in the name, central similarity proximity map. For r = 0, we define Nc^'^(x, M) ~ {x} for all 
x£T (3^3). For X £ d{T (3^3)), we define N^gi^^ ^) = i^} for all r £ [0, 1]. 
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Notice that by definition x € NQg{x, M) for all x £ T (3^3). Furthermore, t < 1 implies that NQg[x, M) C 
T(3^3) for all X € T {y^.) and M e T(3^3)°. For all x e T(3^3)° n i?M(e(x)), the edges e^{x) and e(a;) arc 
coincident iff r = 1. 

Notice that Xi '~ _F, with the additional assumption that the non-degenerate two-dimensional probability 
density function / exists with support S{F) C T(3^3), implies that the special case in the construction of 
^csi') — f^^l^ ^^'^ boundary of two edge regions — occurs with probability zero. Note that for such 
an F, N'^g{X, M) is a triangle a.s. 




Figure 13: Construction of central similarity proximity region. N^,g^^^{x, Mc) (shaded region). 

Central similarity proximity maps are defined with M-edge regions for M E T (y^)" . In general, for 
central similarity proximity regions with Af-edge regions, the similarity ratio of NJjg{x,M) to T [y^) is 
d(x, e^(x))/d{ M, ejx)) See F igure [HI for N^=^^^{x,Mc) with e = 63. The functional form of N^g{x,Mc) 
is provided in ICevhanI ^0^. 



6.2.1 Extension of N^g to Higher Dimensions 

The extension of N^,g to Mf'- for c? > 2 is straightforward, the extension for M = Mc is described, the 
extension for general M is similar. Let J^^+i = {yi>y2, • • ■ 7yd+i} be d -I- 1 points that do not lie on the 
same {d — l)-dimensional hyperplane. Denote the simplex formed by these d -I- 1 points as &{yd+i)- For 
T G (0, 1], define the central similarity proximity map as follows. Let ifi be the face opposite vertex y,; for 
i e {1,2, . . . , (d+l)}, and "face regions" Rcm{'^i), Rcm{'~P2), ■ ■ ■ ,RcM{Vd+i) partition &{yd+i) into d+1 
regions, namely the d -I- 1 polytopes with vertices being the center of mass together with d vertices chosen 
from d+l vertices. For x £ &{yd+i) \ yd+i, let Lp{x) be the face in whose region x falls; x g R{^p{x)). If x 
falls on the boundary of two face regions, ip{x) is assigned arbitrarily. For r S (0, 1], the central similarity 
proximity region N^g{x, Mc) = 6r(a^) is defined to be the simplex 6r(a;) with the following properties: 

(i) &t{x) has faces <Pi{x) parallel to (pi{x) for i e {1, 2, . . . , (d-|-l)}, and for x € RcMifix)), t d{x, (p{x)) = 
d{ip'^{x),x) where d{x,ip{x)) is the Euclidean (perpendicular) distance from x to (p{x)\ 

(ii) Srix) has the same orientation as and similar to &{yd+i)', 

(iii) X is the center of mass of 6r{x), as AIq is of 6(3^d+i)- Note that r > 1 implies that x E N^g{x). 
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6.2.2 Fi-Regions for Central Similarity Proximity Maps 

For N^g, the Fi-region is constructed as follows. Let e^{x) be the edge of Tt{x) parallel to edge for 
i E {1, 2, 3}. Now, suppose u £ Rm{s3) and let Cii^i ^) ^or i E {1, 2, . . . , 7} be the lines such that 



V g Ci(r, u) n Rnies) 

V g C2(t, u) n RAiies) 

V g Csir, u)ni?M(ei) 

V g C4(r, u) n Rm{&i) 



u g e[(i;), 
u g 62(1;), 
M g el{v), 
u g ej(w), 



w g C,^{t,u) n i?A/(e2) 
w g C6('r,M) n i?A/(e2) 
f g C7(t,u) n i?A/(e3) 



u g el{v), 
u g e2(w), 
u g 63(1;). 



(3) 



Then Fi (x, A''^^, M) is the region bounded by these lines. See also FigurefMl Fi (a;, N'^g, M) for x g RM{&i) 
for i g {1, 2} can be described similarly. 

Notice that r > implies that a; g Fi {x,N^g,M). Furthermore, Fi {x,N^g,M) = {x} iff (i) r = or 

{ii) xEdiTiys)). 

The Fi-rcgion Fi {x, Nq^, M) is a convex fc-gon with 3 < fc < 7 vertices. In particular, for r = 1, 
Fi (^x, N^f^^ , M) is a convex hexagon. See Figure [16] (left). 




63 = e(.r) 



Figure 14: Examples of the four types of the Fi-region, Fi ^x, -^(75^^^, -^j'^c^ with four distinct a; g RcAii^s) 
(shaded regions). 



Then the functional form of Qi^) ii^ Equation ^ that define the boundary of Fi (x = (xq, yo), N^,g) with 
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M-ccntral proximity regions in the basic triangle Tt for x G RAiies) is 



Ci(a^) 

C3(X) 



Ceix) 



TO2 (C2 {X - Xo) + yoCl) 



, C2{X) 



m2 (c2 (xo - x) + J/0 (1 - ci)) 



m2 Cl (1 - t) + C2 T mi ' 7712 (1 - Cl) (1 - r) + C2 T (1 - mi) ' 

m2 C2 (x - 1) + 7/0 (m2 (1 - ci) - C2 (1 - m2)) 
Ci(1-Ci)(1-t) 

Cl (m2 (1 - Cl) + C2 (1 - mi)) 7/0 + C2 (^2 (1 - Cl) + C2 (1 - mi)) xq + C2 (r (m2 Ci - mi C2) 

C2 (1 - mi) + m2 (1 - Cl)) a; + C2 T (mi C2 - m2 Cl) ^ C2 (r - Ci (1 - r)) mi 

Cl (1 - Cl) (1 - r) m2 + Cl C2 , 

Tm2 C2a; - 7/0 (ci m2 - C2mi) 7/0 



C2 mi - Cl TO2 (1 - r) 



1 - T 



(1 - Cl) Cl (m2 (1 - Cl) - C2 (1 - mi)) yo + ci C2 (m2 (1 - Ci) - C2 (1 - mi)) xo + (cl Ci (1 - r) mi 
+ C2 T (1 - Cl) + (1 - r) (ci C2 - Cl C2) m2 - C2 T (1 - Cl) - cl Cl (1 -t))x + cIt (nil - ci) / (1 ^ ci) 
(c2 (c2 (1 - r) - r) TOi + Cl (1 - Cl) (1 - r) m2 + ci C2 (1 - 2 t)) 



Proposition 6.9. Let Xn he a random sample from U {T (y^)) . For central similarity proximity maps with 
M-edge regions (by definition M e ^(3^3)°^ and t > 0, (-^(75) < 3 with equality holding with positive 
probability for n > 3. 

Proof: Let M eT (3^3)° and t > 0. Then given X^, for i e {1, 2, 3}, we have 

71 

Ti (A-,,, 7V5s, M)) n RM{e^) = fj [Fi {X^.N^g, M) n i?Af(e.)] 

= Ti (Xe^(n),Ar55,A/) nri (X^, (ti), iVJ^, Af ) n i^A^e.) 

where j,k ^ i, since for x E RM{ei), if {Xek{n)j Xe^in)} C N^,g{x, M), then AV, C N^g{x, AI). Hence 
for each edge we need the edge extrema with respect to the other edges, then the minimum active set is 
Sm = {Xe-^{n), Xe2{'n'), Xe^^n)}, hence ?/„ (N^g) < 3. Furthermore, for the random sample X„, Xe{n) is 
unique for each edge e with probability 1 and there are three distinct edge extrema with positive probability 
(see Theoremi^l). Hence P{r]„ (A^Js) = 3) > for n > 3. ■ 



Note that for r > and a random sample from U{T (3^3)), P{ri{Xn, N^g) = 3) — > 1 as n — > 00, since 
the edge extrema are distinct with probability 1 as tt- — *■ 00. For t = 1, the Fi-region can be determined by 
the edge extrema, in particular Fi (A'„, iVJ^^, M) ni?M(yi) can be determined by the edge extrema Xe^ and 
Xgj, for i,j,k are all distinct. See Figure [15] for an example. Here Fi A^^^^, A/) ^ for all n, because 

by construction Af G Fi A^J^^, A/) since N^^^^iM) = T(3'3). Furthermore, ?/„ (iVJ^) = ?/„ {N^e) 
aU (r,T) G [1,00) X (0,1]. 

Proposition 6.10. The Ti-region Fi (_B, N^"^,^, Af) is a convex hexagon for any set B of size n in T (3^3)°. 

Proof: This follows from the fact that each Ci{x) for i G {1, 2, . . . , 6} is parallel to a line joining to M for 
T = 1 iCrix) is not used in construction of Fi (S, N^'^^, M^). See Figure [TBI (right) for an example. ■ 

With M = Mc, the functional forms of the Hnes that determine the boundary of Fi (a; = (a;o, J/o), -^cs) 
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Figure 15: G{AB) region for xi and Fi-region for n > 1 



in Equation ^ for Th with x G Rcm{gz) are given by 

VQ Cl + C2 {X - Xo) C2 (X'o - + (1 - Cl) 

Cl + T T + 1 - Cl 

^ , ^ C2 r (1 - a;) + j/o , / n t C2 (1 - x) + C2 (xq - x) - yo ci 

(3(2;) = — — — J- ^, C4(a;) = -j ^ , 

C2X + yQ yo 

Csix) = — , C7(a;) - 



1 + T Cl 1 — r 

Cl (1 - ci)yo + Cl C2 a:o - C2 (2t (1 - Cl) + Ci (1 - t)) a; + r C2 (1 - 2ci) 

ci(l-ci)(l-r) 



For n > 1 with M ~ Mc, let X^^ (n) = Xg^ = {ui, Wi) be given for i G {1, 2, 3}. Then the functional forms 
of the lines that determine the boundary of Fi {XnjN^g, Mc) are given by 

W2 Cl + C2 (x - U2) C2 r (1 - x) + yo 

Cl a;) : , Cs a;) = — n ^, 

Cl + r 1 + T (1 - Cl) 

... C2 (Ul - x) + Wl (1 - Cl) C2 X- + UI3 

C2[x) = — , Cs a-') = TT ' 

r + 1 — Cl 1 + rci 

> , X T C2 (1 - x) + C2 (w3 - a;) - w;3 Cl /• / ^ _ "^a 

(,4(X) — — 7 , (,7[X) — 



Ceix) 



t(1-ci)-ci ' 1-r' 

Cl (1 — Cl) Wl + Cl C2 Ui — C2 (2 r (1 — Cl) + Cl (1 — t)) X + T C2 (1 - 2 Cl) 

ci(l-ci)(l-r) • 



See Figure[T6]for Fi (x, TV^I^, Afc) and Fi {Xn, N^'^\Mc). 
Remark 6.11. 

• For Ti < T2, Fi {x.Nllg^M) C Fi {x,N^g,M) for all x G ^(ya) with equality holding only when 
x(.d{T{y^)). 

• Let be a set of n points in T{yj,). Then for n < T2, Fi TV,!,'^, M) C Fi TV,!,"^, M) with 
equality holding only when C 9(T (3^3)). 

• Suppose X^Y are iid from a continuous distribution F whose support is S{F) C T(y^). Then for 
n < r2, (Fi (X, 7VJ^5, M)) A (Fi (F, iVj^^, M)). 
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Figure 16: The Ti-region Ti {x, N^g,Mc) with x G i?M(e2) (left) and Ti (X„, iVj^, Mc) with n > 1 (right). 



• Suppose Xn and A"^ are two random samples from a continuous distribution F whose support is 
S{F) CTiy^). Then for n < r2, ^(Fi (^„, TV^^^, M)) <^^^ A(ri (A"^, A^J^^, M)). □ 

7 Investigation of the Proximity Regions and the Associated PCDs 
Using the AuxiUary Tools 

7.1 Characterization of Proximity Maps Using 77- Values 

By definition, it is trivial to show that that the minimum number of points to describe the Fi-rcgion. 
Vn{N) < n for any proximity map. We have improved the upper bound for Np^ and N^g: rj{Xm ^pe) — 
and rj{Xn, N^g) < 3. However, finding such an improvement does not hold for rjn^NAs)] that is, finding a 
k < n such that r]n{NAs) l£ k for all Xn is still an open problem. 

Below we state a condition for N{-,A'I) defined with A/- vertex regions to have 7]n{N) < 3 for Xn with 
support in T (3^3 ) . 

Theorem 7.1. Suppose N(-^M) is a proximity region defined with M-vertex regions and B is a set of n 
distinct points in T (3^3). Then rj{B, N) < 3 if 

(i) for each G 3^3 there exists a point x{yi) G B (i.e., related to yi) such that Fi(i?,iV) n i?A/(yi) = 



(a) there exist points x{yj), x{yk) G B such thatTi{B,N)r\RM{yi) ~ Fi {x{y j) , N)r\T i (a;(yfc), iV)ni?Af (yO 
for j,k ^ i with i G {1,2,3} and {j,k) G {(1, 2), (1, 3), (2, 3)}. 

Proof: Let B = {xi,X2, . . .,Xn} C Tiy^). 

(i) Suppose there exists a point x{yi) G B such that Fi(i3, A^) n RM{yi) — Fi {x{yi), N) n RM{yi) for each 
i G {1,2,3}. Then 



Fi {x{y,),N)nRM{y^), 



or 



Fi(B, N) n RM{y^) - Ti {x{y^),N) n RM{y^) = 



n 3 



3 



f|[Fi (x„Ar)ni?M(y,)] = f][ri {x{yj), N) n Rm iy^)] 



f|Fi(x(yj),iV) ni?M(y,) 
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and 



TiiB,N) = |J[ri(B,7V)ni?M(yO] = IJ 



f|ri ixiy,),N) 



ni?A/(yi) 



Then, we get 



Hence, the minimum active set Sm C {a:(yi), a::(y2), a;(y3)}, which imphes r]{B,N) < 3. The 7;-value 
77(5, N) < 3 will hold if x{yi) arc not all distinct. 

(ii) Suppose there exist points x{yj) and x{yk) such that Fi {B, iV)ni?A/(yj) = Ti {x{yj), A^)nri {x{yk), N)r] 
Rniyi) for j,k ^i. Then 



Ti [B, N) n i?A/(yO = Ti (a:(y,), TV) n Ti (a;(yfe), TV) n i?M(y,) = 

n 3 

fl [Fi (a;., iV) n i?A/(yO] = f) [Ti (x(y,), iV) n i?A/(y.)] - 

i=l q=l 



fjri (x(y,),TV) 

.9=1 



ni?A/(yO 



and 



TiiB,N) = U[ri(i3,7V)ni?A/(y,)] = IJ 



i=l \ lq=l 



fjri ix{y,),N) 



ni?A/(yO 



Then, we get Ti{B, N) = Hi^i ix{yi), N) . Hence, the minimum active set Sm C {^(yi), x(y2)i a;(y3)} 
which implies 'q{B, N) < 3. ■ 

Notice that Np^ satisfies condition (i) Theorem 17. II 

Below we state some conditions for N{-,M) defined with M-edge regions to have 77- value less than equal 
to 3. 

Theorem 7.2. Suppose N{-, M) is a proximity region defined with M -edge regions and B is set of n distinct 
points in T (iVs). Then ri{B, N) < 3 if 

(i) for each Ci G {ei, 62, 63}, there exists a point x{ei) G B such that Ti{B, N) D RAiici) = Ti {x(ei), N) D 
RAiiei), 



(ii) there exist points x{ej), x{ek) S B such thatTi{B, N)r]R]\i{ei) = Fi {x{ej), N)nTi (x{ek), N)r]R]\i{ei) 
for j,k^i with i G {1, 2, 3} and (j, k) G {(1, 2), (1, 3), (2, 3)}. 

Proof: Let B ^ {xi,X2, . . . ,Xn} C T (y^). 

(i) Suppose there exists a point x{ei) G B such that Ti{B, N) n RM{ei) = Fi (^(6;), N) n RMi^i) for each 
i G {1,2,3}. Then 



Fi(B, N) n i?A/(e,) = Fl {x{ei), N) n i?A/(e,) = 

n 3 

f|[Fi (x„7V)ni?A/(e,)] = nt^i {x{e,),N)C^RM{ei)\ 



1=1 



fjri {x{e,),N) 



ni?A/(e,) 



and 



Fl (B, iV) = U [Fl (B, TV) n Rm [ei)] = |J 



fjri {x{e,),N) 



ni?A/(ei) 
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Then, we get 

3 

Ti {Xn.N) ={~]Ti {x{e,),N). 

Hence, the minimum active set Sm C {a;(ei), a:(e2), ^(es)} which imphes •qn{N) < 3. 

(ii) Suppose there exist points x{e.j) and x{ek) such that Fi N)r\RM{ei) = Fi {x{ej), iV)nFi (a;(efc), A'')n 
RhiiGi) for j,k ^ i. Then 

Fi [B, N) n i?M(e,) = Fi {x{ej), N) n Fi (x(efc), TV) n i^Afle,) = 

" 3 

n RM{ei) 



f|[ri (a;„7V)ni?M(e,)] = fj [Fi (x(e,), TV) n EM(e.)] 

1=1 q=l 



fjri (x(e,),7V) 



and 



Ti{B,N) = \J[Tl{B,N)f^RM{e^)] = |J 

i=l i=l 



fjri (x(e,),iV) 

.9=1 



ni?M(e,) 



Then, we get Fi(i3, A^) = Fi (a;(ei), A^) . Hence, the minimum active set S* a/ C {x(ei), x(e2), a:(e3)} 
which imphes r]{B^ N) < 3. ■ 



Notice that N'[;g satisfies condition (ii) in Theorem 17.21 

7.2 The Behavior of Ti N) for the Proximity Maps in T (3^3) 

In Sectional we have investigated the behavior of Fi {Xn, N) for general proximity maps in fi. The assertions 
made about Fi-regions wih be stronger for the proximity regions we have defined, i.e., for Nas, ^pe^ ^ii*i 
N^g, compared to the general assertions in Sectional One property enjoyed by these proximity maps is that 
the region N{x) gets larger as x moves along a line from d{T (y^)) to ^s(iV) in a region with positive M^- 
Lebesgue measure. So the modifications of the assertions in Section U also hold for {Ns, Nas, Np^, N^g}. 
In particular, we have a stronger result than the one in Proposition [O] in the sense that, ,'%s{N) is a proper 
subset of Fi {Xn, N) as shown below. 

Proposition 7.3. For each type of proximity map N G {Ns, Nas, Np^, ^cs} ^^'^ '^''^V fo.ndom sample Xn = 
{Xi,X2,...,Xn} from a continuous distribution F on r(3^3), if ^s{N) ^ 0, then ^s{N) C Fi(A'„,iV) 
a.s. for each n < 00. 

Proof: We have shown that .^s(A^) C Fj {X„,N) (sec Proposition O]) . Moreover, Ms{N) ^ for Nas, 
Npp, with r > 3/2, and N^f^^. For these proximity regions, Ms{N) = Fi {Xn, N) with probability for each 
finite n since 

(i) for N £ {Ns, Nas}, Ti {X„, N) = S^s (N) iff Xy(ri) = y for each y G 3^3 which happens with probability 
0, 

(ii) for N e {iV5^\ Np^^^^Y Ti{X„,N,M) = ^s{N,M) iff Xe,(n) G e, for each i e {1,2,3} which 
happens with probability 0. 

Furthermore, for N^j^ with r < 3/2, ^s{N^e,M) 7^ iff Af ^ (^'■)° (see Equation © for ^'■), say 
M = im^,my) is such that d{C{m^,y2),y2) < d(y2,e2)/r. Then Ti [x^, N^pe, M) = ^s{N'pe,M) iff 
Xe2{n) G 62 which happens with probability 0. Similarly the same result also holds for edges Ci and 63. I 

Note that, if Ms{N) ~ and Xn is a random sample from a continuous distribution on T{y-^), then 
Fi {Xn,N) = a.s. as n 00. In particular, this holds for Np^i', M) with r < 3/2 and M G {3^'')°. Lemma 
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l4.6l holds as stated. In Lemma we have shown that Fi {X{n), N) is non-increasing. Furthermore, for the 
proximity regions {Ns, Nas, ^pe^ -^csl; '^^ '^^^ state that Fi {X{n + 1), TV) C Fi {X{n), N) with positive 
probabihty, since the new point in X{n + 1) has positive probabihty to faU closer to the subset of T (y^) 
that defines ^siN) (e.g., d{T (3^3))). The general results in Theorems and [Hn] and Proposition SH hold 
for the proximity maps {Ns, Nas, ^pei ^cs\ ^Iso. In particular we have the following corollaries to these 
results. 

Corollary to Theorem Given a sequence of random variables Xi, X2, . . .''^ IA{T (3^3)), let X{n) :~ 

X{n-l)U{Xr,} forn = 0,1,2,.. . withX{0) 0. Then for each N € {Ns, Nas, N^j^, N^g}, Ti {X{n),N) j 
^siN) as 00 a.s., in the sense thatTi {X{n + l),N) C Fi {X{n),N) and A{Ti {X{n),N)\Ms(N)) i 
a.s. 

Corollary to Proposition \4-S\ For positive integers m > n, let Xm and Xn be two random samples 
from U{T (3^3)). Then A (Fi {X^,N, M)) <^'^ A (Fi (X^, N, M)) for proximity maps N G {N'pE,Nl;g}, for 
r e [1,00) andr e (0,1]. 

For N E {Ns, Nas}, the stochastic ordering of A{Ti (Xn, N, M)) is still an open problem, although 
we conjecture that A{Ti {Xm, N, M)) <^'^ A(ri {Xn, N, M)) for Xm and Xn two random samples from 
U{T{y3)) with m>n. 

Corollary to Theorem \4-10[ Let {Xn}^^i be a sequence of data sets which are iid U{T [y^)). Then 
Fi {Xn,N) ^ Ms{N) asn-.^forNe {Ns, Nas, N^pe^ N^cs)- 



7.3 Expected Measure of Fi-Regions 

Let A(-) be the R'^-Lebesgue measure on R'' with d>l. In M, A(-) is the length | • |, in M^, A(-) is the area 
A{-), and in W^, A(-) is the d-dimcnsional volume V{-). In M with 3^2 — {0, 1}, let Xn be a random sample 
from IA{Q, 1), and Ns{x) = B{x, r{x)) where r{x) = min(x, 1 — x). Then, 

Fi {Xn,Ns) = [Xn:nl2, (1 + Xi^„)/2] ^ A(Fi (A'„, Ns)) = |Fi {Xn, Ns) I = (1 + - Xn:n) /2. 

Hence the expected length of the Fi-region is 



¥.[\{Ti{Xn,Ns)] = E 



1 + Ni-n — Xn:n 



2 



1 + E - E [Xn:r^ 

2 



1 + — — 1 

' n+1 n+1 ^ n 

= ■ ■ — = > as n ^ OO. 

2 n + 1 

In M^, with three non-coUinear points 3^3 = {yi,y2,y3}, let Xn be a random sample from U{T{y3)). Then 
^(Fi [Xn, N^j^,M)) > a.s. for all ?i< 00, r > 3/2, M e \ 3^3. Furthermore, A{Ti {Xn, iV(5|i, M)) > 
a.s. for aU n < 00 and M G (T (3^3))°; and for N e {Ns, Nas} A{ri {Xn,N, M)) > a.s. for all n < 00 and 
MgR2\3?3. 

The Fi-region, Fi (<%"„, TV), is closely related to the distribution of the domination number of the PCD 
associated with TV. Hence we study the asymptotic behavior of the expected area E [A(Fi {Xn, N, M)], as 
n ^ 00, for TV e {TV^s, TV^^, TV^^}. 

In R, TV5 and TV^5 are equivalent functions, with the extension that TV^5 is defined as TV5 for X points 
outside (yi:m,ym:m)- In higher dimensions, determining the areas of Fi (A'„, TV, TV/) for for TV e {TV5,TV^5} 
and for general Xn and hence finding its expected areas are both open problems. 



7.3.1 The Limit of Expected Area of Fi (<%"„, TV, M) for TV^^ and N^g 

Recall that for TV G {TVp^, TV^^}, Fi (<%"„, TV, A/) is determined by the (closest) edge extrema Xe{n) € 
argminj(-g_Y„ c'(-'^7 e), for e G {61,62,63}. So, to find the expected area of Fi {Xn, N, M), we need to find 
the expected locus of Xe{n); i.e., the expected distance of d{Xe{n) from e. For example, for Xn a random 
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sample from a continuous distribution F, argminJ(-g;^-^ d{X,e) is unique a.s., and if d{Xe{n), e) = u, then 
Xe{n) falls on a line parallel to e whose distance from e is u a.s. 

Lemma 7.4. Let Di{n) := (n), e,) /or i e {1, 2, 3} anrf A",! be a random sample from IA{T (3^3)). Then 

E [Di{n)\ — !■ (i.e., the expected locus oj Xe^{n) is on Ci) for each i G {1, 2, 3}, as n ^ 00. 

Proof: Given Zi = {Xi,Yi) U{T [y^)). Then for e = 63, D^in) = Yi-n (the minimum y-coordinate of 
Zi e Xn). First observe that <y) = ^ ''^2 — hence 

i^y(2/) = ^^^^^^1(0 < 2/ < C2) + I(y > C2). 
So the pdf of Y, is /y(y) = 2 ^^-^1(0 < y < C2). Then the pdf of Yi-^ is 

/i:„(y) = 2 n(c2 - y) (^1 - ^^^^^) C2-2l(0 < y < C2). 

Therefore, 

E [yi:„] = / '2yn(c2-y) f 1 - ^ ""2^ ) C2-2dy = 0, as n ^ 00. 

Hence E [Yi:„] = E [Asl"-)] ^ 0. Similarly, E [D,{n)] for i € {1, 2}, as 71 -> 00. ■ 

Theorem 7.5. Let X„ be a random sample from U{T (ys)) and M G T {y-sY . For N G {iV^^;, N^s}' 
E [A(ri {Xn,N, M))] -> A{Ms{N, M)) as 00. 

Proof: RecaU that for N G {N'pj^, N^s}^ (A-^, TV, M) = flLi (^e. (n), A^, M)- Moreover, Ti (A'„, iV, A/) 
^s(iV, M) iff (ri) G Ci for i G {1, 2, 3}. In Lemma [7.41 we have shown that expected locus of Xe{n) con- 
verges to edge e as n — * 00. Hence the expected locus of 5(ri (A'„,A^, M)) n RM{ei) converges to the 
a(^s(7V, M)) n i?M(e») for each i G {1, 2, 3}. Hence 

E [A{ri {Xn, N, M)] A{^siN, M)) as n ^ 00. ■ 

Remark 7.6. In particular, 

i- E [A (Li {Xn, NIe, Mc))] 1/4 as n ^ 00, since (A^pb, Mc) = ^(Mi, Af2, M3). 

ii- E [A (Li [Xn, N^E, A/))] ^ as n ^ 00 if A/ G ^^ since {N^e^ ^'^) ^ ^r M G 



iii- Furthermore, E 



A (Fi iV^/J,A/c))] ^ since (iV^/J,Mc) = {A/c}. 

iv- For any M G T (3^3)°, E [A (Fi (A'„, iVJ^, A/)))] ^ as n ^ cx), since (Af,?;^, Af) = {Af}. 
V- We also have E [A (Fi iV^^, Mc))] for r G [1, 3/2] as n ^ 00. 

vi- Furthermore, by careful geometric calculations, we get E [A (Fi {Xn, NpE, Mc))] V3 [1 — 3/(2 r)]^ 
for r G (3/2,2]. 

vii- E [A(Ti (A'„,iV^^,Afc))] ^ V3/ [4(1 - S/r^)] for r G {2, 00], as n -> 00. 
We also derive the rate of convergence of E [A{Ti {Xn, NpE, Mc))] for r = 3/2. 
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Theorem 7.7. Let Xn he a random sample from U{T (y^)). For r = 3/2, the expected area of the the 
Ti-region, E [A (Fi {Xn, Np^, Ale))], converges to zero, at rate O 



Proof: For r = 3/2 and M = Mc, and sufficiently large n, Fi (Xe- ,-/Vp^) n RcAiiVi) is a triangle for 
z = 1,2,3 w.p. 1. See Figure [T71 With the realization of the edge extrema denoted as = {xi,yi) close 
enough to e,;, for z = 1, 2, 3, 



Fi ^Xei,-/Vp'^^ n RcAiiyi) is the triangle with vertices 



+ (-9 + 2V3,. + 6..)),(^i f (-^ + 2V3,. + 6 

Fi (^Xe2, Np^^^ n RcAiiyi) is the triangle with vertices 



X2 



1 \/3 
2' Is" 



1 ^/3 

2 ~ ~3" 



(3 + 4 \/3 1^3-12x3)^ , 1^ 
and Fi ^Xej, Np^^^ D RcAiiys) is the triangle with vertices 



73 
' 18 V 



y3 + X3,-— (-3 + 2\/3y3-6.T3 



1 Vs] 

2' 6 ] ' 



1 



ya =*(l/2, ^/2) 




yi = (0,0) y2 = (l,0) 

Figure 17: The shaded regions arc the triangular F!^^^(XeJ n RcMiVi) regions for i ~ 1,2, 3. 
Then for sufficiently large n, 

A (Fi [Xn,N'JlM)) = ll (3x2 - 3 + V3y,y + ^ (-3x3 + VSya)' + 



V3 



(Sxl ~ 6x2 + 2V3y2 X2 - 2V3y2 + yl + 3 + yi - 2V3y3X3 + 3x1 + 4: 
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y,^ (1/2, v/3/2) 




yi = (iJ. iiJ y^Hi.o) 
Figure 18: The figure for the joint pdf of X^^- The shaded region is r(C)- 



To find the expected area, we need the joint density of the Xg. . The edge extrema are ah distinct with 
probability 1 as n ^ oo (see Theorem [63]) . Let T(C) be the triangle formed by the lines at parallel to Ci 
for i = 1, 2, 3 where C = [xi, yi, X2, 2/2, 2:3, j/3). See Figure flSl 

Then the asymptotically accurate joint pdf of Xe- (jCevhan and Priebd ()2007f )') IS 



n{n- l)(7i-2) 



n— 3 



n{n - l){n - 2) ( \/3/36 f-2 VSyi + Vly^i - 3x3 + VSys + 3x2 



/3(C) 



with the support Ds = {C G 3^^^ : {xi^yiYs are distinct}. 
Then for sufficiently large n. 



n— 3 



V3/4 



E 



A Fi A-,,, TV 



.3/2 



A/ 



a(Fi (A'„,7V^/j))/3(C)dC 



^(F.(^„,<i)).(.-l)(.-2)(^ 



Let G(C) = A{T{())/A{T (3^3)). Notice that the integrand is critical when £ for i = 1, 2, 3, since G{C) 
when Xsi & Ri for each i = 1, 2, 3. So we make the change of variables yi — zi, 2/2 ~ \/3 (1 — X2) — Z2, and 
V3 2:3 — Z3, then G(C) and A(Fi (A'„,iV)) becomes 

G(C) = G(zi, Z2, 23) = (2 zi + Z3 - \/3 + Z2) ' /3 and A(ri (A",,, iV)) VS (z| + + 4 zf) /9, 

respectively. Hence the integrand does not depend on xi,X2,X3 and integrating with respect to xi,X2,X3 
yields a constant K. Now, the integrand is critical at {zi,Z2,Z3) — (0,0,0), since G(0,0, 0) = 1. So 
let be the event that < < e for i = 1, 2, 3 for e > small enough. Then making the change 

of variables Zi ~ Wi/n for i ~ 1,2,3, we get A ^Fi (^'^n, Np^^^^ ~ O and G(zi, 22,^3) becomes 
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G{wi,W2,W3) = 1 - ^ {2/V3{2wi +W2 + W3)) + O (n"^), hence 

a(t, (a-^^AT^/J^^ n{n - l){n - 2)^ G{wi,W2,W3r-^dwidw2dw3, 



rne pne pne 

K 



/O "'0 "'0 

letting n 00 

/•OO />00 /"OO 



/•OO />00 /"OO 

since /(,°° exp (— 2/\/3 (2 wi + W2 + W3)) (iwi(iw2t^W3 = 3-^3/16 which is a finite constant. Hence 
A {Vi {Xn, TVp/J, AlYj ^ as n ^ 00 at the rate O [n''^). ■ 



E 



8 Ffc-Regions for Proximity Maps in T (3^3) 

We can also define the regions associated with jn{N) = fc for fc < n. 

In M with 3^2 = {0,1}, 7n(-^) < 2, hence we can only define r2-regions. Recall that jn{N) = 2 iff 
n i±fi-] = iff C [0, 1] \ Ti TV). So 

r2(A'„,iV) = {(x,2;)e[0,l]2: A'„CiV(x)UiV(y); a;,y^ri(A'„,iV)} 
= e [0,l]2\ri(A'„,7V)' : X, C N (x) U N (y)}. 

Notice that r2 (A'„, N) C [0, 1]^. Let 

xi := argmin^g;t,^n(o,i/2)(l/2 - a;) and X2 := argmin^g;t,^n(i/2a) (a^ - 1/2), 

then 7n(-^) = 2 iff xi, 2:2 ^ Ti ^)- In such a case Xn C A^(a;i) U N{x2) by construction. 
In general, 



Definition 8.1. The r2 -region for proximity map 7Vj;(-) and set B C VLisT2{B,N) {{x,y) e [f7\ri(B)] 
-B C Ny[x) U -/V;y(y)}. In general, Tk-region for proximity map Ar3;(-) and set i? C for fc = 1, 2, . . . , n is 



2 . 



Tk{B, N) = < (xi, a::2, . . . , Xk) G fl*^ ; -B C [J A'^y (xi) and all possible m-permutations {ui, U2, . . . , u„i) 
[ i=i 

of (xi, X2, . . . , Xfc) satisfy (ui, M2, . . . , Um) ^ r,„(i?, iV) for each m = l,2,...,fc— ij^.D 

Note that Ffc-rcgions are defined for fc < n and they might be empty. Moreover, Ffc-regions are in fl'^, 
not in Q. 

Let I I denote the Stirling partition number for a set of size n into m blocks and let | | denote all 
Stirling partitions of a set A into m blocks; that is. 

In particular, | | is the unordered pair of blocks Bi and B2 such that Bi ^ $ and Bi C AVi for i = 1, 2, 
and BiUB2 = Xr,. Note that B2 = X,, \ Bi. Then 
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Proposition 8.2. T2 {Xn,N) = U{ei Balej-^a" l^^^ ('^i.^) x Ti {B2,N)] \ Ti (A'„,iV)^ /or any nonempty 
Stirling blocks Bi and B2 m | | . 

Proof: Given Xn, suppose {u,v) e r2 A^), then A'„ C iV(u) U iV(u) and u,v ^ ri(A'„,Af). Let 
Si = n N{u), and = n N{v)] \ N{u). Then Bi and S2 are two Stirling blocks in | |. Hence 
Bi C A^(w) ^ueVi {Bi,N)\Ti {Xn,N) and C N{v) ^veVi {B2,N)\Vi TV), hence T2 iV) C 
U{Bi,g2}G{-*,"}[ri {Bi,N) X Ti (S2,Af)] \ (Fi (A'„,iV))2. The other direction is trivial, hence the desired 
result follows. ■ 



In M with = Ns, we can exploit the natural ordering available. 

n-l 

Proposition 8.3. In M witli 3^2 = {0, 1}, T2 {Xn,Ns) = |J 



fe=i 



Proof: Recall that T2 {Xn.Ns) = U{ei,e.}e{ } Fi {Bi.Ns) x Ti iV^)] \ Ti (A-^, iVs)'- Let {Si, S2} be 

a Stirling partition in | "^'^ |. First observe that Bi = {xi:„, . . . , xu-n] and B2 = {a;(fc+i):rii ■ • ■ j Xn-.n] forms a 
Stirling partition of AVi, and 



Tl ({Xl:„, . . . , Xk:n}, N s) \ Tl (A'„, iVg) 



\ 



2 ' 2 



Xk:n Xji-ji 



and 



Fx ({x(fc+i) , . . . , Xn-.n], A^s) \ri (A'„, TVs) 



Furthermore, 



Xn:n 1 + ^^(fc+l) 



2 ' 2 



l + a:i:n l + ^(fc+i) 



where 



Fi({xi:„,...,Xfc,„},iVs)xFi({x(fc+i),...,a;„:„},7Vs) D |J Fi (Si, iV^) x Fi (S2, iVs) (4) 

{Bi,B2}eA 



A = |{Si,S2} ^ 1"^" I • 2''l:n e Bi, Xn;n G S2, max(Si) = Xk:n, miu (S2) = / < fc 

l + xi:„ l + a;(fc_i) 



Xk:n Xriin 



Hence the desired result 



since the left hand side in Equation ^ is 
follows. ■ 

Similarly, for Np^, 

F2 A^pb) - U [riiBi,N'pE,M)xTi{B2,N'pE,M)]\ri{Xn,N^pE,Mf . 
{ei,e2}e{-^2"} 

Note that Fi (Si, A^p^, M) is determined by the edge extrema in Bi, i — 1,2. Furthermore, if {u,v) € 
F2 iXn,N^j^, M), then (u, v) ^ i^M(y)^ since either N^e{u) C N^^^iv) or N^si^) Q N^si^) should hold if 

{u,v) e RMiyf- 

For N-cs, 

F2 (-%-„, A^5s, M) = U [Fi (Si, ATJs, M) X Fi (S2, A^^s, M)] \ Fi (A'„, TVJ^, uf . 
{ei,B2}6{'^2"} 
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Note that Ti (S^, N^jg, M) is determined by the edge extrema of B^, i = 1, 2. But if (w, v) G r2 N^g, M), 
then (u,f) G Rm{s)^ can hold. In these proximity regions, not all edge extrema should fall in the same 
partition set for Ffc [X^ N^g, M) to be nonempty. 

For any proximity map N , 



P{jn{N) = 2) = P{X^^ n r2 {X,„N) ^ 7„(iV) ^ 1) 

/ r 

■^nfl U [Ti{B^,N)-KT,{B2,N)]\{T,{Xr,,N)f 

\ [{ei,e2}e{'^2"} 



P 



^0, A'„nri (A'„,iv) 



A more compact way to write this is as 

p{in{N) > 2) - p {xl n r<2 [Xn, N) = 0) 

where r<2 {X„,N) := U{ei,e.}e{ } (Si,Af) x T^ (62, A^). 
Furthermore, for k > 3, the Ffc-regions are defined similarly as 

rk{X„,N)= y [Fi(6i,iV) X ... X Fi(6fc,A)]\Fi(A'„,A)* 

{Bi,...A}e{-^"} 



Hence, 



P(7„(A^) = fc) = P{X^nrk{Xn,N)^(d,j,,{N)> k^l) 

p I A-^ n 



U Fi(6i,A)...xFi(6fc,A)\Fi(A'„,iV)' 
{Bi...,B.}e{'L"} 



A more compact way to write this is as 

PilniN) >k) = P {X^ n F<fe (A-,,, Af) ^ 0) 
where T<k{Xn,N) ■=V]{b,,...,b,}<^{\^}^^ (■^i'^) x ••• x ri(6fe,A^). 

9 Values for the Proximity Maps in T (3^3) 

Recall that the domination number, 7n(A^) is the cardinality of a minimum dominating set of the PCD based 
on A^. So by definition, jniN) < n. We will seek an a.s. least upper bound for ^n{N) which suggests the 
following concept. 

Definition 9.1. Let Xn be a random sample from F on r(3^3) and let 7n(A^) be the domination number 
for the PCD based on a proximity map A^. The general a.s. least upper bound for 7ti(A^) that works for all 
n > 1 is called the n-value; i.e., Kn{N) := min{fc(n) : 7 {Xn,N) < k{n) a.s. for all n > 1}. □ 

It is more desirable to have a k- value that is independent of n. Further, if k„(A^) = k, exists for A^ and 
is independent of n, then the domination number has the following discrete probability mass function: 



7 AT) = <^ 



1 w.p. pi, 

2 w.p. P2, 



w.p. p^. 
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Figure 19: The figure for k„ {N^g) = n. 

In R witli 3^2 = {0,1}, for a random sample from U{Q,1), 7n(-^) < 2 with cquahty holding with 
positive probability for N e {Afs,^As}- Hence KniNs) = 2. But in R"* with d > 1, finding k„(7V) for 
N G {iVs, Nas} is an open problem. Next, we investigate the K-values for {Nasi ^pe^ -^cs) 

Theorem 9.2. LetXn be a random sample from U{T (y 3)), and M eR'^XVa. For N^j^{-, M), (N^e) = 3. 

Proof: For A^p^(-,Af), pick the point closest to edge in vertex region i?j\/(yi); that is, pick Ui G 

aTgmmxex„nRM(yi) ^(^' = ^'^&'^^^xex„nRM{y.) ^(^(V' V^) ™ vertex region for which A'„ni?A/(yi) 7^ 
for i e {1, 2, 3} (note that as n — > 00, n i?M(yi) 7^ for all i £ {1, 2, 3} a.s., and also Ui is unique a.s. 
for each i, since X is from Z^(r (3^3))). Then n i?M(y») C N^'pj^iU,, M). Hence C ULi ^pe{U,, M). 
So 7„ {Np-^, Mc) < 3 with equality holding with positive probability. Thus k„ (Np^) = 3. ■ 

There is no least upper bound for 7ri(-^c'S' that works for all n > as shown below. 

Theorem 9.3. Let X„ be random sample from 'U{T {y^i}). Then 7„(iVJg,Af) ^ n holds with positive 
probability for all t G [0, 1] . 

Proof: For t = 0, the result follows trivially. For r = 1, we will prove the theorem by showing that there 
is a union of n regions of positive area in T (3^3), so that u G Nq^.^{v, M) iff u = w, for any u,v G X„. Let 
M = (mi,TO2) € r(3^3)°. In i?A/(e3) locate n triangles evenly on 63 with base length 1/n and similar to 
2^(3^3) (with similarity ratio 1/n). See also Figure [TOl Then locate n points in each triangle at Zi = {xi,yi) 
such that {xi,yi) is the same type of center of Ti as M is of 2^(3^3). Then using the similarity ratio 
of NQ^^{zi,M) to T(3^3), namely, yi/m2 = 1/n, we get yi = m2/n for all i = 1,2, ...,n. Moreover, 
Xi — Xi-i = 1/n for i = 2, 3, . . . , n with xi = mi/n and a;„ = 1 — (1— mi)/n. Then (xi, yi) G N'/f^^((xj, yj), M) 
iff I = j. Furthermore, for sufficiently small £ > 0, the same holds for the e neighborhood of each Zi = (x^, yi). 
That is, A*"p^^(.T, M)nB{zj,e) = for all x G B{zi,e) for any distinct pair i, j G {1, 2, . . . , n}, and probability 
of Xn being composed of n points one from each B{zi,e) is positive. Then 7„(A^p^,M) = n holds with 
positive probability. The result for r G (0, 1) follows similarly. ■ 
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9.1 Characterization of Proximity Maps Using k- Values 

Notice that for the proxhiiity maps we have considered, k„ (Np^) = 3 and k„ (N^g) = n. One property of 
proximity maps that makes k„(A^) < n is that probabihty of having an for which N{X) D = {X} for 
all X G Xn is zero. 

Below we state a condition for Kn{N{-, A/)) < 3 for N{-, M) defined with M- vertex regions. 

Theorem 9.4. Suppose N(-, M) is defined with M -vertex regions with M G \3^3 and N{x, M) gets larger 
as d{£{y,x),y) increases for x e i?7\/(y) in the sense that N{x,M) C N{z,M) for all x,z € i?M(y) when 
d{i{y,x),y) < d(£(y,z),y). Then k„{N) < 3. 

Proof: When Xn n i?Af(yi) 7^ 0, pick one of the points Ui{n) G argmaxJ^g;^,^p^^^(y.■) (i(£(yi, X), y.j), then 
A'„ n i?Af(yj) C N{U,{n)) for each i G {1, 2, 3}. So 7 (A'„, TV, M) < 3, and hence"K„fiV) < 3. ■ 

Notice that Np^ satisfies the conditions of Theorem 19.11 

Theorem 9.5. Suppose iV(-, M) is defined with M-edge regions and N{x, M) gets larger as d{x, e) increases 
for X e Rm{£) in the sense that N{x,M) C N{y,M) for all x,y G Rm{g) when d(x,e) < d{y,e). Then 
K„{N) < 3. 

Proof: When Xn n RM{ei) 7^ 0, pick one of the points Uei{n) £ a.Tgma.Xx^x,^r]RM{ei) '^(■^^^i)- Then 
Xn n RM{e,) C N{Ue{n)) for each i G {1, 2, 3}. So 7 (A'„, TV, M) < 3, and hence K„(iV) 3. ■ 

In Theorems 19.11 and 19. 1[ we have an upper bound for k„(A^). To determine an exact value for k„(A^) 
we need further restrictions. Let Ai := {x G T (y^) : n{N{x,M)) = ^J-iT (y^))} and A2 := {{x,y) G 
T (3^3) X T (3^3) : iiiN{x, M) U N{y, M)) = fi{T (3^3))}. If in addition to the hypothesis of Theorem (and ) 
we have Ai and A2 have zero measure (e.g., zero area for continuous distributions with support in r(3^3)) 
then Kn{N) = 3 would hold. 



10 Discussion and Conclusions 



In this article, we provide a probabilistic characterization of proximity maps, associated regions, and digraphs 
and related quantities. In particular, we discuss the probabilistic behavior of proximity regions, superset 
regions, and Fi-regions; construct digraphs (proximity catch digraphs(PCDs)) and investigate related quan- 
tities such as domination number, 77 and kappa values. We also provide auxiliary tools such as vertex and 
edge regions for the construction of proximity regions. 



Al thoug h Fj-regions and superset r egions were introduced before (jCevhan and Priebd (j2005[ ) ICevhan et al 
(l2006l) . and lCevhan and Prie b3 (|2007l) ) a thorough investigation is only performed in this article. Gi-regions 
are a sort of a "dual" of proximity regions and are associated with domination number being equal to 1. We 
provide a probabilistic characterization of Fi-regions for general proximity maps N and data points from 
distribution F. We also extend this concept by introducing Ffe-regions, which are associated with domination 
number being equal to k. 

We introduce the quantities related to Fi-regions and domination number, namely, 7y-values and re- 
values. ?7-value is the minimum number of points in a set required to determine the Fi-region for that set. 
We determine some general conditions that make rin{N) < 3 for data in the triangle T(3'3). k- value is the 
a.s. least upper bound for the domination number of the PCDs. We also determine some general conditions 
that make Kn{N) < 3 for data in the triangle 7(3^3). 

We pro vide two PCD f amilie s, namely proportional-edge PCDs ( Cevhan et al. ( 20061 ) and central similar- 
ity PCDs ICevhan et all (|2007[) ) as illustrative examples. We discuss the construction of proximity regions 
and Fi-regions for these PCDs. Furthermore, we calculate the expected area of Fi-regions for these PCDs 
in the limit. 
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Determining Fi-regions, 77 and k values for spherical and arc-slice PCDs contain many open problems 
and are subjects of ongoing research. 

With the above characterizations, given another PCD, then we can determine how it behaves in terms of 
Fi-regions and related quantities; in particular we can determine a.s. least upper bounds for the domination 
number of the new PCD. 
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